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Refine Search 



Search Results - 



Term 


Documents 


(3 NOT 4).PGPB,USPT,US0C,EPAB,JPAB,DWPI,TDBD. 


3 


(L3 N0TL4 ).PGPB,USPT,USOC,EPAB,JPAB,DWPI,TDBD. 


3 
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Database: 



Search: 



L5 



Search History 



DATE: Wednesday, November 15, 2006 Purge Queries Printable Copy Create Case 

Set Name Query Hit Count ^ 

side by side Name 

result set 

DB=PGPB, USPZ USOQEPABJPMDWPl TDBD; THES=ASSIGNEE; PLUR=YES; 
OP=AND 

L5 L3notL4 3 L5 

L4 L3 and (oedema or diarrhia) 7 L4 

L3 (porcine or swine) same (FUTl and polymorphism) 10 L3 

L2 Bosworth-Brad-T$.in. 8 L2 

LI B osworth-Brad . in . 1 LI 

END OF SEARCH HISTORY 
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Inventor Name Search Result 



Page 1 of 1 



Day : Wednesday 
Date: 11/15/2006 
Time: 16:38:50 



Inventor Name Search 

]:*iiter tlie first few Sellers of the Inventors [>a3t Niinie. 
Additionally, enter the first few letters of the kiventors Imtsi name. 



Last Name 



Vogeli 



First Name 

[peter ;:S^gi:p?-j 



To go back use Back button on your browser toolbar. 
Back to PALM I ASSIGNMENT I OASIS I Home page 



http://expoweb 1 :8002/cgi-bin/expoAnvInfo/invquery pi 
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Inventor Name Search Result 



Page 1 of 1 




Day : Wednesday 



Date: 11/15/2006 
Time: 16:38:50 

Inventor Name Search 

Enter tht llrst few letters of the laveritors L.ast Name. 
Additionally, enter the first. I\*w letters of the liiventor's j^'irsi name. 

Last Name First Name 

Bosworth \ [Brad 

To go back use Back button on your browser toolbar. 
Back to PALM J ASSIGNMENT I OASIS I Home page 
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ov^v^ivi^ ocon^ii ivcauiLS x^ctoiia lui rxyyiiK^aiiyjii u>'o-T-rz.^Jo aiiu ocan^ii xvcauit ua-u:7-on-r-^,. . x a^c i ui j;/ 



SCORE Search Results Details for Application 
09844268 and Search Result us-09-844-268- 
IZ.rge. 



Score Home Retrieve Application SCORE System 
Page List Overview 



SCORE Comments / 
FAQ Suggestions 



This page gives you Search Results detail for the Application 09844268 and Search Result us-09- 

844-268-12. rge. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



DM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



November 13, 2006, 16:38:38 ; Search time 7441 Seconds 

(without alignments) 
10905.695 Million cell updates/sec 

US-09-844-268-12 
1269 

1 ctcgagccatgtgggtcccc actcagaggaaaccctgcag 1269 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 6366136 seqs, 31973710525 residues 

Total number of hits satisfying, chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



12732272 



Database 



GenEmbl : 



1: 


gb_env : * 


2: 


gb_j>at: * 


3: 


gb_ph : * 


4: 


gb_pl : * 


5: 


gb_pr : * 


6: 


gb_ro : * 


7: 


gb_sts : * 


8: 


gb_sy:* 


9: 


gb_un : * 


10 


: gb_vi : * 


11 


: gb_ov:* 


12 


: gb_htg : * 


13 


: gb_in:* 


14 


: gb_om : * 


15 


: gb_ba : * 
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ov^v^x>aj< ocaiL.li Avcauita i^ctaiia ivji r^yyn^aiiKjii u:7o-r^z,uo oiiu ocai»^ii avcsuil ua-uj^-o-t-r-^.., r a^c ^ 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


1 


'1269 


100 


. 0 


1269 


2 


BD085288 


2 


1269 


100 


. 0 


1269 


2 


AR360893 


3 


1269 


100 


. 0 


1269 


2 


AR771735 


4 


1269 


100 


. 0 


1269 


2 


AX752829 


5 


1264 .2 


99 


. 6 


1269 


2 


AR2003.99 


6 


1261 


99 


. 4 


1269 


14 


PIGFTF 


7 


1245.8 


98 


. 2 


3315 


14 


U70883 


8 


1245 


98 


.1 


2528 


14 


AF136896 


9 


1094.8 


86 


.3 


1098 


2 


AX029029 


10 


871 


68 


. 6 


199471 


12 


AC168093 


11 


864.4 


68 


.1 


1867 


14 


AF394920S2 


12 


823.6 


64 


. 9 


1083 


14 


AF18 64 65 


13 


808.6 


63 


. 7 


8174 


2 


AR075308 


14 


808.6 


63 


.7 


8174 


2 


134189 


15 


808.6 


63 


.7 


8549 


5 


AY923051 


16 


808.6 


63 


. 7 


37402 


5 


AC009002 


17 


806.8 


63 


. 6 


3373 


2 


141409 


18 


806.8 


63, 


. 6 


3373 


2 


AX552703 


19 


806.8 


63, 


. 6 


3373 


5 


HUMHBLOD 


20 


806.8 


63. 


. 6 


3373 


7 


G28569 


21 


806.8 


63 , 


. 6 


3376 


2 


CQ715321 


22 


805.4 


63 . 


.5 


190025 


12 


AC140145 


23 


793.4 


62 , 


. 5 


1199 


2 


AR039078 


24 


793.4 


62 . 


. 5 


179538 


12 


AC024740 


25 


792.4 


62 , 


, 4 


1167 


5 


DQ092446 


26 


792 


62 , 


, 4 


1166 


5 


AF045545 


27 


790.8 


62 . 


3 


1136 


2 


AR039080 


28 


790.8 


62 . 


3 


1268 


5 


HSFUTIMUT 


29 


789 


62 , 


2 


1159 


5 


BC074732 


30 


788.6 


62 . 


1 


1144 


5 


AY611628 


31 


787.6 


62 . 


1 


1194 


5 


HSA276886 


32 


785.8 


61 . 


9 


1155 


2 


AR070448 


33 


785.8 


61 . 


3 


1155 


2 


186204 


34 


784.4 


61. 


8 


1098 


5 


AB004859 


35 


784 .4 


61. 


8 


1098 


5 


AB004861 


36 


784 .4 


61. 


8 


1098 


5 


AB004862 


37 


784.4 


61. 


8 


1098 


5 


AF455028 


38 


784 .4 


61. 


8 


1098 


5 


DQ157279 


39 


784.4 


61. 


8 


1098 


5 


DQ321370 


40 


784.2 


61. 


8 


1322 


5 


AF080607 


41 


782.8 


61. 


7 


1098 


5 


AB004863 


42 


782.6 


61. 


7 


1155 


2 


AR006821 


43 


782.6 


61. 


7 


1155 


2 


AR069905 


44 


781 


61. 


5 


1312 


5 


AF080603 


45 


779.4 


61. 


4 


1323 


5 


AF080605 



Description 



BD085288 Methods a 
AR360893 Sequence 
AR771735 Sequence 
AX752829 Sequence 
AR200399 Sequence 
L50534 Pig (Sus sc 
U70883 Sus scrofa 
AF136896 Sus scrof 
AX029029 Sequence 
AC168093 Bos tauru 
AF394921 Bos tauru 
AF1864 65 Bos tauru 
AR075308 Sequence 
134189 Sequence 5 
AY923051 Homo sapi 
AC009002 Homo sapi 
141409 Sequence 2 
AX552703 Sequence 
M35531 Human GDP-L 
G28569 human STS S 
CQ715321 Sequence 

AC140145 Homo sapi 
AR039078 Sequence 

AC024740 Homo sapi 
DQ09244 6 Homo sapi 
AF045545 Hylobates ■ 
AR039080 Sequence 
Z69587 H. sapiens F 
BC074732 Homo sapi 
AY61162 8 Homo sapi 
AJ27688 6 Homo sapi 
AR070448 Sequence 
186204 Sequence 1 
AB004859 Homo sapi 
AB004861 Homo sapi 
AB004862 Homo sapi 
AF4 5502 8 Homo sapi 
DQ15727 9 Homo sapi 
DQ321370 Homo sapi 
AF080607 Macaca mu 
AB004863 Homo sapi 
AR006821 Sequence 
AR069905 Sequence 
AF080603 Pan trogl . 
AF080605 Gorilla g 



ALIGNMENTS 



RESULT 1 
BD085288 
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LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 



COMMENT 



FEATURES 

source 



ORIGIN 



BD085288 1269 bp DNA linear PAT 27-AUG-2002 

Methods and compositions to identify swine genetically resistant to 
F18 E coli associated diseases. 
BD085288 

BD085288. 1 GI: 22 6308 98 
JP 2001521401-A/12. 
unidentified 
unidentified 
unclassified sequences. 
1 (bases 1 to 1269) 
Bosworth, B.T. and Vogeli,P. 

Methods and compositions to identify swine genetically resistant to 

F18 E coli associated diseases 

Patent: JP 2001521401-A 12 06-NOV-2001; 

BIOTECHNOLOGY RESEARCH AND DEVELOPMENT CORP, US DEPARTMENT OF 
AGRICULTURE, SWISS FEDERAL INSTITUTE OF TECHNOLOGY ZURICH 
OS Unidentified 

JP 2001521401-A/12 
06-NOV-2001 

20-MAY-1998 JP 1998550579 
20-MAY-1997 US 60/047181 
BRAD T BOSWORTH, PETER VOGELI 
C12Ql/68,C07K14/47 
Strandedness : Singles- 
Topology : Linear; 

Methods and compositions to identify swine genetically CC 

resistant to F18 E 
coli associated diseases 
Key Location/Qualifiers 
CDS 9. .1103. 

Location/Qualifiers 
1. .1269 

/organism="unidentif ied" 
/mol_type=" genomic DNA" 
/db xref="taxon: 32644" 



PN 
PD 
PF 
PR 
PI 
PC 
CC 
CC 
CC 

CC 
FH 
FT 



Query Match 100.0%; Score 1269; DB 2; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1269; Conservative 0; Mismatches 0; 



Length 12 69; 



Indels 



0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

N I I I I I I I I I I I I M nil Illllllll IMIIII 

CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 
I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I 
TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I M I I I I I I I I M I I I I II I I I I 
TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 
i I I I I I I I I 11 I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 
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ov^vyivj^ ocaiL/ii xvcauita i^cuuia lui r^p^iioauuii uj^onnz.uo mm ocan^u xvcauii ua-u:7-onn-^.,, ra5C ^ 
Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I 

301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 
Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

I I N I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I 

361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I I I I I I I I I M M I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

N I I I t I I M I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I M I I I M I I I M I I I I I I 

Db 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

> N I I I I I I I M I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

IIIIMMII MMMIIIIIIIIItlMIIIIMIIIIIMIIlll 

Db 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

III III! IIIIIIIIIIMIIIMII II Ml I Mlllllll III llllll IIIIMMII 

Db 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

I I I N I I I I I I I I II II I I II II I I I II I I M I II II I II I II II II II II M 

Db 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I I I I I I I I I M I I M I II I I I II I I II I II I I I II I I I II II I II II I II II M II M M 

Db : ' 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 
Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

f I I M I M II I II I I II II I I II I I I I II M II I I M II I II I II I I M II 

Db 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

I I i N M II II I I I I I II I II I M I I II II I I II II I II I M t I II I I I I I I II I I II II 
Db 901 TGCAGTGCTVACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

t I I I I I M I I II II I I I M M I M I II M I I M II M II II 

Db 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I I I N M II II I I It II II II II I II II I M M I I II I II II I I M II II II I II I II M 

Db 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

I II I I I M I II I I I I I I I I I I II II II I I I II I M II M II I II I 

Db 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

I I I I I I I I I I i I I M M I I I I II I I M II I I I I I M II I II II I I II I M II II I I II M 

Db 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

Qy 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 
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ov^vyi\Xi ocm^^ii xvcauita j^cimia lui rvppin^auuii v/:7o^n^vjo aim ocmv^ii xvcauit ua-v:7-ot-r-^... r a^c ^ vji j;? 



Db 

Qy 

Db 



M I I I I I M I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I t I I I I I I I I I I I 

1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 

1261 ACCCTGCAG 1269 

I I I I I I M I 
1261 ACCCTGCAG 12 69 



1260 



RESULT 2 
AR360893 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 



FEATURES 

source 



ORIGIN 



AR360893 1269 bp DNA linear PAT 17-AUG-2003 

Sequence 12 from patent US 659 6923. 

AR360893 

AR360893.1 GI: 337 684 02 

Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1269) 

Bosworth, B.T. and Vogeli,P. 

Methods and compositions to identify swine genetically resistant to 
F18 E. coli associated diseases 
Patent: US 6596923-A 12 22-JUL-2003; 

Biotechnology Research & Development Corp., The United . States of 
America as represented by the Secretary of Agriculture and Swiss 
Federal Institute of Technology; Peoria, IL 

Location/Qualifiers 

1. .1269 

/ organism="unknoxvn" 
/mol_type="genomic DNA" 



Query Match 100.0%; Score 1269; DB 2; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1269; Conservative 0; Mismatches 0; 



Length 1269; 



Indels 



0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 



CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 
I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I M I I I I II I I t I I I I I I I I M I I 
CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 

IMIIIIIIIIIIIMMIIIIIIIMIIIIMIIII Illllllllll 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 

TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 

ItllNIIII II I I I I I II I II I I I M I I I I II I II I I I II M I 

TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 

CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 
I I I I I I i M I I II II II I I I I I I I I I II I M I II I I I I I I I I I I I I M I I I I I I II I I I I 
CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 



60 



60 



120 



120 



180 



180 



240 



240 



CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

I I I I I I I I I i I I I I I M M I I I I I I I I I I I I I 1! I I II I I II II I M i I I I I I I I I I I I I 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

NMIIIIIIIIIilllllllllll I IIIIIIMIIIIIIIIIIIIII 

CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 
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Db 361 



I I > I I I I I I I I I I I I I I I M I I I I i I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I 
CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 



Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGGCCACTTAA 480 

I I I I I I I M I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 



Db 421 



Qy AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

I I N I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II , I , I 

AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 54 0 



Db 481 



Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

IIIIMIIIIIIIIIIIIIIIIMI I I I I I I M I I I I I I I I I I I I I I I 

GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 



Db 541 



Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

IINMIIIIIIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIlliiiMIIIIMMI 

Db 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

N I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I 
661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 



Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

IIIIMMIIMIIIIIIIMMII IIIIIIMIIIIIIIIIIIIIIIIIMIll 

ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 



Db 721 



Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

"Mi I I I I i I M I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I I N I I I I I I I I I I I I I I I I I I I t I I I M I I I I I I M I I I M M I I I I I I I I I I I I I I I I 

Db 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 



Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

INlllllllllllilll I M I I M I I I I I I I I I M I M I I I I M I M I I I I I I I 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 



Db 901 



Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

I I I N I I N I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I M I I 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 



Db 961 



1020 



Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

Nlllfll i I I I I M I M I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I 

Db 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 



Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 114 0 

IIMNIMIIIIIIIIt I I I I I M I I I I I I I I I I I I I I I I I I I 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 114 0 



Db 1081 



Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

I I I I M I I I I I I I I I M I ri I I I I I M I I M I I I I I I I I I I i I I I I I I I M I I I I I I I I I 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 



Db 1141 



Qy 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 12 60 

IIIMIMIIIMIIMII Illlllllll Illllllllllllll 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 



Db 1201 



Qy 1261 ACCCTGCAG 1269 

I i I I I I I I i 
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L^v^v^Avu, ocon^ii ivcduiia J^cuaiia lui r^yyu\^au\Jii yjyi^^-^^Kjo oiiu ocan^ii xvcauit ud-u:7-o-r-t-^... ra^c / j:7 

Db 1261 ACCCTGCAG 1269 

RESULT 3 
AR771735 

LOCUS AR771735 1269 bp DNA linear PAT 08-DEC-2005 

DEFINITION Sequence 12 from patent US 6965022. 

ACCESSION AR771735 • 

VERSION AR771735.1 GI:83347396 

KEYWORDS 

SOURCE Unknown. 

ORGANISM Unknown. 

Unclassified. 
REFERENCE 1 (bases 1 to 12 69) 

AUTHORS Bosworth, B.T. andVogeli,P. 

TITLE Methods to identify swine genetically resistant to F18 E. coli 

associated diseases 
JOURNAL Patent: US 6965022-A 12 15-NOV-2005; 

The United States of America as represented by the Secretary of 

Agriculture and Swiss Federal Institute of Technology; Washington, 

DC 

FEATURES Location/Qualifiers 
source 1. .1269 

/organism="unknown" 
/mol_type="genomic DNA" 

ORIGIN 

Query Match 100.0%; Score 1269; DB 2; Length 1269; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1269; Conservative 0; Mismatches 0; Indels 0; Gaps ' 0; 

CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

I > 1 N I I I M I I I I I I I I I I I [ I I I M I I I I I I I M I I I I I 

CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 
I I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I M I I I I I I I I 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I M I I I I I j I I 
TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 
I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I t I I I I I I I t I I I I I I I I I I I I I I I I I 
CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

I I I I I I I I I I I I I I II MM Nil MM ill I MM INI III I 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 
CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

I I M II I II II II I M I M I M II II II II II II II M I II II I II II I M I II M II II 

CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 
CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

UN MMMIMMMMMMMMMMM Illlllllllll 

CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 
IIIMIIIIIIIIIIII IIIIMIIIIIIIIMIII IMIIIII 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 
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ov^vyivu xvcsuita x^ctoiia lui ri.pynK.auun wyo^^^yja miu ocan.ii ivcauit ua-v:7-.o^n-^... ra^c o ui 

Db 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

I I I I I I N I I I I I I I I M M I I I I I I I I I I I I I I M I I I I I I I I I I I I i I I I I I I I I I I I 

Db 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

I I I I I I M I I M I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ,, I 

Db 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I I I M I I I I I I I I I I M I I I I I I I M I I I I I I M I I M I I I I I I I I I I I I I I I M I I I I I 

Db 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

NIIMIIIMIIIIIIIIIIIIIMIMIIIIMIIIIIIIMIIIIMIIIIIIIIII 
Db 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

I I I N I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I I t I I I I I I I I I I I t I I I I I I I I I I I M I I I I M I I I I I I I t M I I I I I I I M I I I I I I I 

Db 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I I I I I I I I I I I I i I I I I I I I I I I I I I I I II M I I I II I I II I I II II I II 

Db 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

Qy TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

> I II M M M I II II I I I t I I II I I I II t I I I I I I 1! M I I II t I I II II I M I 

Db 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

II MIIIIIIIIMIIIIIMMIIIIMIIIMMIIMIII I 

Db 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

Qy ' 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

IINM N I I I I I i I I I I I I I I I I I I I M I M I M I I I I I I I I I I I 

Db 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

Ill J i I I I I I I I I I I I I I I I I I I i I I I I I I I M I I I I I I I I M I I I I I I 

Db 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

ININIIill Mllllllllllllllll MIIIIMIIIIII 

Db 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

Qy 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

Illlll lilllMllllllliMIIIIIIIII IIIIIIIIIII 

Db 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

Qy 1261 ACCCTGCAG 1269 

I I I I I II I I 
Db 1261 ACCCTGCAG 1269 

RESULT 4 
AX752829 
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LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 



FEATURES 

source 



gene 
CDS 



AX752829 1269 bp itiRNA 

Sequence 12 from Patent EP1310570. 
AX752829 

AX752829.1 GI: 32134722 



linear PAT 20-JUN-2003 



Euteleostomi; 
Suina; Suidae; 



ORIGIN 



Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; 
Sus . 
1 

Bosworth, B.T. and Voegeli, P . 

Methods and compositions to identify swine genetically resistant to 
F18 E. coli associated diseases 
Patent: EP 1310570-A 12 14-MAY-2003; 

Biotechnology Research and Development Corporation (US); U.S. 
Department Of Agriculture (US) ; Swiss Federal Institute of 
Technology Zurich (CH) 

Location/Qualifiers 

1. .1269 

/organism="Sus scrofa" 
/mol_type="mRNA" 
/db_xref="taxon: 9823" 
1. .1269 
/gene="FUTl" 
9. .1106 
/gene="FUTl" 
/codon_start=l 

/product="alpha (1,2) fucosyltransf erase" 
/protein_id="CAD99177 .1" 
/db_xref ="GI : 32134723 " 

/translation="MWVPSRRHLCLTFLLVCVLAAIFFLNVYQDLFYSGLDLLALCPD 

HNWSSPVAIFCLAGTPVHPNASDSCPKHPASFSGTWTIYPDGRFGNQMGQYATLLAL 

AQLNGRQAFIQPAMHAVLAPVFRITLPVLAPEVDRHAPWRELELHDWMSEDYAHLKEP 

WLKLTGFPCSWTFFHHLREQIRSEFTLHDHLRQEAQGVLSQFRLPRTGDRPSTFVGVH 

VRRGDYLRVMPKRWKGWGDGRYLQQAMDWFRARYEAPVFWTSNGMEWCRKNIDTSR 

GDVIFAGDGREAAPARDFALLVQCNHTIMTIGTFGFWAAYLAGGDTIYLANFTLPTSS 
FLKI FKPEAAFL PEWVGINADLS PLQMLAGP " 



Query Match 100.0%; Score 1269; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1269; Conservative 0; Mismatches 



DB 2; Length 12 69; 
0; Indels 0; 



Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 



CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

I I I I I I M I I I I I I I I I M IN I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

I I I N I I t I I I I I I I I I I I M I N t I I I I I M I I N I I I I I I M I M I I I I I I I I I I I I I 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 
TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

IIIMIMItlllllMlllltllllllllllMI IIIIIIIIIMIIIIIII 

TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 



I Illlllilllll illlMIII illllll II 

CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 
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Db 241 



t I I I M M I I I I I I I I I M I I I I I I I M I I M I I I I I M I I I M I I M I I I I I I I I I i I I 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 



Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

N I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I 
CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360^ 



Db 301 



Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

llllllfllMIIIMIIIIIIIIIIIMIIIIIIIlliiiiMIIIMIIIIIIIIIII 

Db 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I N I I I I I I i I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II , I 

421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 



Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 



Db 481 



Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

INIII IIIM IIMIIIMIIMIIIMIIIIIIMIIMIII 

Db 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

N M I I I I I I I I I t I I I I I I I I I M I I I I I I I I I I II II I I I I I I I I I I I I I M I M I I I 

Db 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

Qy 661 ACGTGCGCCGCGGGGACTATGTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I I I I I t I I I I I I I M I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 



Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

IIINIIIIMIIIIIIIIIIIIIMIIIMIIIIIItllllllMIIIMIIIMlMi 

ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 



Db 721 



Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I I I I I I I I I I M I I I I I I I I I i I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I 

781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

I I N I I I I I I I I I I I I I I t I I I I M I I I I I I I I I I M I I 

Db 901 TGCAGTGCAACCACACCATCATGACCATTGGGACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I II I I II II II I II II I I I II 
Db 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 



Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

I N i I I I I I I I I I II II II I I I I I I I I I I I I I I I I I I I II I I II I II M I 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 



Db 1081 



Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 
mil I I II I I I I I I II I I I I I I I I I M I I I II I I I I I I I II I I I I I I I I I I M 
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Db 

Qy 

Db 

Qy 

Db 



1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 



1201 



1201 



TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCT^GGACTCAGAGGAA 

N NMIIIIMIllliiMiiiiiMIMIIIIMMII 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 



1260 



1260 



1261 ACCCTGCAG 1269 

I I I I I I I I I 
1261 ACCCTGCAG 1269 



. RESULT 5 
AR200399 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
FEATURES 

source 



ORIGIN 



AR200399 1269 bp DNA 

Sequence 1 from patent US 6355859. 
AR200399 

AR200399.1 GI: 20250473 



linear PAT 20-APR-2002 



Unknown. 

Unknown . 

Unclassified. 

1 (bases 1 to 1269) 

Bosworth,B., Ridpath^J. and Wiseman, B. 

Interactions between genotype and diet in swine that prevent E. 
coli associated intestinal disease 
Patent: US 6355859-A 1 12-MAR-2002; 

Location/Qualifiers 

1. .1269 

/organism="unknown" 
/mol_type="unassigned DNA" 



Query Match 99.6%; 
Best Local Similarity 99.8%; 
Matches 1266; Conservative 



Score 12 64.2; DB 2; 
Pred. No. 9.4'e-313; 
0; Mismatches 3; 



Length 1269; 



Indels 



0; Gaps 



0; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 
N I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I 
1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 

61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 
> I N I I I I I I I M I I I I I I I t I I I I I I I I I I I I I I I I I I M I i I I I I I M I I I I I I M I I 
61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 

121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 

N I M I I I I I I i I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I M I I I 

121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 

181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 
I I N I I I I I I M I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I 
181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 

241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 

IIMIIIillll Ill I I II I II II II II II I II II II II I I I I II II 

241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 

301 



60 



60 



120 



120 



180 



180 



240 



240 



300 



300 



CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

IIMIIIIMIII I I I I I I I I I I I I I I II Illllllllllll 

301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 
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ov^v^ivu, ocaicii ivcauita i^ctoiia loi r^yyii\,aii\jii u:7o-rnz,uo aim ocaiL.ii xvcauit ua-u^- 



ra^c iz* m j:7 



Db 361 



I N I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M 
CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 



Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I M I I I I I t I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

Qy AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

I I M I I I I 11 I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I M I I I I I I I I i I I I I I I 

Db 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 



Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

I I I I M I I I I I I I M I I I I I I I I ri I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I 

GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 



Db 541 



Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I M 

Db 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

Qy ^61 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I > M I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I 

ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 



Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

I I I I I M I M I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I M 

ACGGCGCTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 



Db 721 



Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

> N I I I I I I I I I t I I I I I I I I I I I I I i I I I I I I I I I I I I I M t I I I I I I t I I I I I I M I I 
TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 



Db 781 



Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I N I I I M I I I I I I I I I I I I I M I I I I 11 I I I I I I I I I I I I I I I I I M I N I I I 

Db 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 



900 



Qy 501 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

I I I I I M I I t I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I 
TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 9 60 



Db 901 



Qy 561 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

III i IIMIilllMlllllllllllillliilllllllllilliii 

Db 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 



Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

' Mllllllllllll IIIIIIIIIIIIMIMI Ml 

TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 



Db 1021 



Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

llllllllllllllllllllllllll i I M I I I I I I I I t I I I I Ill 

Db 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 



Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 
I II II I I II I I Hi Mill I III INI III I III I I Mill Mill MM II II 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCCAGTAACTTCCGGAGATGCTGGTGG 1200 



Db 1141 



Qy 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

I I I I I I I I I I I I I I I II I I I II I I II I II M I I I II M II I I II M II II II M i I I M I 

Db 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

Qy 1261 ACCCTGCAG 1269 

I I I I I II II 
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x'a^c u ui 



Db 



1261 ACCCTGCAG 12 69 



RESULT 6 

PIGFTF 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 
FEATURES 

source 



5'UTR 
CDS 



3'UTR 
ORIGIN 



^^^^^^ 1269 bp DNA linear MAM 27-JUN-1996 

Pig (Sus scrofa) alphal,2 fucosyltransf erase gene, complete cds 
L50534 

L50534.1 GI:1395136 

fucosyltransf erase; glycosyltransf erase . 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; 
Sus . 

1 (bases 1 to 1269) 

Cohney,S., Mouhtouris, E , , McKenzie, I . F. and Sandrin,M.S. 

Molecular cloning of the gene coding for pig 

alphal — >2 fucosyltransf erase 

Immunogenetics 44 (1), 76-79 (1996) 

8613146 

Original source text: Sus scrofa (tissue library: genomic) DNA. 
Location/Qualifiers 
1. .1269 

/organism="Sus scrofa" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9823" 
/cell_type= " liver " 
/ ti s sue_lib= " genomic " 
1. .8 
9. .1106 
/codon_start=l 

/product="alpha-l, 2-fucosyltransf erase" 
/protein_id="AAB02984 .1" 
/db_xref="GI : 1395137 " 

/translation="MWVPSRRHLCLTFLLVCVLAAIFFLNVYQDLFYSGLDLLALCPD 

HNWSSPVAIFCLAGTPVHPNASDSCPKHPASFSGTWTIYPDGRFGNQMGQYATLLAL 

AQLNGRQAFIQPAMHAVLAPVFRITLPVLAPEVDRHAPWRELELHDWMSEDYAHLKEP 

WLKLTGFPCSWTFFHHLREQIRSEFTLHDHLRQEAQGVLSQFRLPRTGDRPSTFVGVH 

VRRGDYLRVMPKRWKGWGDGAYLQQAMDWFRARYEAPVFWTSNGMEWCRKNIDTSR 

GDVIFAGDGREAAPARDFALLVQCNHTIMTIGTFGFWAAYLAGGDTIYLANFTLPTSS 

FLKIFKPEAAFLPEWVGINADLSPLQMLAGP" 

1107. .1269 



Query Match 99.4%; 
Best Local Similarity 99.6%; 
Matches 1264; Conservative 



Score 1261; DB 14; 
Pred. No. 5.7e-312; 
0; Mismatches 5; 



Length 12 69; 



Indels 



0; Gaps 



0; 



Qy 

Db 

Qy 

Db 

Qy 

Db 



1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 

I) >l mill IINIIMIIIIIIIlMiiiiiMiiiiiiiiiiiii 

1 CTGCAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 



61 



61 



121 



TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 

II'IIIIIIMIIIIIIIINIIIIIIM IIIIIMIIII t Ill 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 



TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 

IIIIMIIMIMIIIIIIIIII Illililll Illlllllllllll 

121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 



60 



60 



120 



120 



180 



180 
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■ a^c m ui jy 



Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

I > I I I I I I I I I I I I I I I I I illllMlllllllllllllllllllllllllllii 

CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 



Db 181 



Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 



Db 241 



Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

Ml Ill I II III III II III III II I I I t I I I I I I I I t I I I 

301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 



Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I 

CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 



Db 361 



Qy ^521 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

N II M I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

l> Illlllllll Illlllllllltlllllllll Illtlll 

D^ 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 54 0 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

I > I I I I I I I I I I I I I I I II I I I I I I I I II I II I I I I I I I I I I II I I M I I I I I I I I I I I I 

Db 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 



Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 



Db 601 



Qy ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

■Illlllllll I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I 

ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 



Db 661 



Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

KIM I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 ACGGCGCTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

II Illlllt I Illlllllll IIIIIIMIIIIII 

Db 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 



Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I I I N I I I I I I I I t I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II II I I I I I I I 
ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 



Db 841 



Qy TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

llllllll IIIMIIIIIIIIIIIIIIIIIMIIIIIIIirilllllll 

Db 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

IIIIIIMIIIIIIMIMIIIIIIIIIIIIIIIMMIMIIIIIIIIIIIIIIIIIII 

961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

llllllllllllllliMIIIIIIIIIIIIII Illllllllllll MM 

Db 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 



http://es/ScoreAccessWeb/GetItem.action?AppId=09844268&seqId=8248 1 2&ItemName=u. . . 11/1 5/06 



. a^c 1-^ ui j:7 



Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 



I I I I I I I M I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 
I I I I I I I M I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCCAGTAACTTCCGGAGATGCTGGTGG 1200 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

N I I I I I I I I M I I I M I I I M I I I I I I I I M I I M I I I M I I I I I I I I I I I I I I I I I M 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 



I I I I I I M I 



RESULT 7 

U70883 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REMARK 
COMMENT 
FEATURES 

source 



gene 
mRNA 



U70883 3315 bp DNA linear MAM 12-OCT-2005 

Sus scrofa alpha ( 1, 2 ) fucosyltransf erase (FUTl) gene, complete cds 
U70883 

U70883.2 GI:4680729 

Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; 
Sus. 

1 (bases 337 to 1844) 

Meijerink,E. , Fries, R., Vogeli,P., Masabanda, J. , Wigger,G., 
Strieker, C, Neuenschwander , S . , Bertschinger , H .U. and 
Stranzinger, G. 

Two alpha (1,2) fucosyl transferase genes on porcine chromosome 6qll 
are closely linked to the blood group inhibitor (S) and Escherichia 
coli F18 receptor (ECF18R) loci 
Mamm. Genome 8 (10), 736-741 (1997) 
9321466 

2 (bases 337 to 1844) 

Meijerink,E. , Fries, R., Voegeli,?. and Stranzinger, G. 
Direct Submission 

Submitted (17-SEP-1996) Animal Science, Swiss Federal Institute of 
Technology, Tannenstrasse 1, Zurich, ZH CH-8092, Switzerland 

3 (bases 1 to 3315) 

Meijerink,E. , Fries, R., Voegeli, P. and Stranzinger , G. 
Direct Submission 

Submitted (26-APR-1999 ) Animal Science, Swiss Federal Institute of 
Technology, Tannenstrasse 1, Zurich, ZH CH-8092, Switzerland 
Sequence update by submitter 

On Apr 26, 1999 this sequence version replaced gi:2558692. 
Location/Qualifiers 
1. .3315 

/organism="Sus scrofa" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9823" 
/ c hr omo s ome = " 6 " 
/map="6qll" 
<1. .3046 
/gene="FUTl" 

join(<l. .50,607. .3046) 
/gene="FUTl" 

/product="alpha(l, 2) fucosyltransf erase" 
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ov^wxvij. ocaiv.li ivcauiLd x^cuui:> ikjl rvppiioauuii \jy<j-t-t^\jo miu ocan^ii xvcauit ua-u^- 



5'UTR 

intron 

CDS 



variation 



variation 



3'UTR 



variation 



polyA_signal 



repeat_region 



ORIGIN 



join(<l. .50,607. .608) 

/gene=" FUTl" 

51. .606 

/gene="FUTl" 

609. .1706 

/gene="FUTl" 

/experiment="experimental evidence, no additional details 
recorded" 

/note="similar to human FUTl and porcine FTF genes, 
GenBank Accession Numbers M35531 and L50534 respectively" 
/codon_s tar t=l 

/product="alpha (1, 2) f ucosyltransf erase" 
/protein_id="AAB81884 il" 
/db_xref="GI : 2558 693" 

/translation="MWVPSRRHLCLTFLLVCVLAAIFFLNVYQDLFYSGLDLLALCPD 

HNWSSPVAIFCLAGTPVHPNASDSCPKHPASLSGTWTIYPDGRFGNQMGQYATLLAL 

AQLNGRQAFIQPAMHAVLAPVFRITLPVLAPEVDRHAPWRELELHDWMSEDYAHLKEP 

WLKLTGFPCSWTFFHHLREQIRSEFTLHDHLRQEAQGVLSQFRLPRTGDRPSTFVGVH 

VRRGDYLRVMPKRWKGWGDGAYLQQAMDWFRARYEAPVFWTSNGMEWCRKNIDTSR 

GDVIFAGDGREAAPARDFALLVQCNHTIMTIGTFGFWAAYLAGGDTIYLANFTLPTSS 

FLKIFKPEAAFLPEWVGINADLSPLQMLAGP" 

915 

/gene="FUTl" 

/note="observed polymorphism which changes the amino acid 

sequence at the corresponding position" 

/replace="a" 

1465 

/gene="FUTl" 

/note="observed polymorphism which changes the amino acid 

sequence at the corresponding position" 

/replace="a" 

1707. .3046 

/gene="FUTl" 

2148 

/gene="FUTl" 
/replace="t" 
3028. .3033 
/gene="FUTl" 

/inference="non-experimental evidence, no additional 

details recorded" 

3085. .3128 

/rpt_type=tandem 

/rp t_uni t_s eq= " g t " 



Query Match 98.2%; 
Best Local Similarity 99.4%; 
Matches 1261; Conservative 



Qy 

Db 

Qy 

Db 

Qy 

Db 



Score 1245.8; DB 14; Length 3315; 
Pred. No. 3.8e-308; 
0; Mismatches 7; Indels 1; 



Gaps 



1; 



1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

'I II MM IMIIIIIMIIIIIIIIIIMIIIIIIlliiiiiiii 

601 CTGCAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTG1 



JTG 660 



61 



TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

I I I I I I I I I I I I I illlllllllltlllllllllllllM Ill 

661 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 720 
121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

I IIMIIMIIMIIMIIIIIIIIIII MM Mill 

721 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 780 
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ov^v^Avij, ocoicii xvcauita j^ciaiid lui nppiiv^auuii Kjya-t-t^\jo oiiu ocan^ii avc»uil Ud-u:7-o-rn.. . x a^c i / ui j:7 

Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml 
Db 781 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCCTTT 840 

Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

I I M M I I M I I I I I I M I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I M 

Db 841 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 900 

Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

I M I I I I I I I I I I I I I M I I M M M I I I I I M I I I I I I I I I I I I I M M I I I I I M I I I 

Db 901 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 960 

Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

I I I I I M I I I I M I I I I I I I I I I I M I I I I I M I I I M I I I I I I I I I I M I I I I I I I I I I 

Db 961 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 1020 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

M I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 1021 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 1080 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

I I I I I I M I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I M I I 

Db 1081 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 1140 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

I I i I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I 1 M I I I I I I 

Db 1141. GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 1200 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f 
Db 1201 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 12 60 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

N I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I t I I I I t I I I I I I I I I 
Db 12 61 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 1320 

Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

t III I I I I I I I I I I I I I I I I I I I M I I I I I 1 I I I I I I I I I I I I I I I I I I I I I t I I I j 
Db 1321 ATGGCGCTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 1380 

Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I N I I I I I I U I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I 

Db 1381 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 1440 

Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I I I i I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 1500 

Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

I I I I I I I I I I I I I M M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 1501 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 1560 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGT^GA 1020 

I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1620 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

II IIIIIIIIIIMIIIIMIIIMIIIIII I I III MM 

Db 1621 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1680 

Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 
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o^v^iVLj ocon^ii ivcauita J^ctoiia nji n.yyii\^ausju u:7o-tn-£.uo aim ocan^ii xvcauit ud-u:7-on-t... 



Db 


1681 


Qy 


1141 


Db 


1741 


Qy 


1201 


Db 


1801 


Qy 


1261 


Db 


1860 



INIIIIIIIIII IIIMIIMIIIIIIIIIIIIIIIIIIlltllll 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 
I I I I I I i N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



1740 



1200 



1800 



INIIIMII IIIIIIIIIMIIIIIMIMIIIIIIllllMiiiiiiiii 



1 1 1 1 1 1 1 1 1 



RESULT 8 
AF136896 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
CDS 



AF136896 2528 bp mRNA linear MAM 27-FEB-2001 

Sus scrofa alpha-1, 2-fucosyltransf erase (FUTl) mRNA, complete cds . 



AF136896 
AF136896.1 



GI:7328563 



Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; 
Sus . 

1 (bases 1 to 2528) 

Meijerink,E. , Neuenschwander , S . , Fries, R., Dinter,A., 

Bertschinger,H.U. , Stranzinger , G. and Vogeli, P.. 

A DNA polymorphism influencing alpha (1 , 2 ) fucosyltransf erase 

activity of the pig FUTl enzyme determines susceptibility of small 

intestinal epithelium to Escherichia coli F18 adhesion 

Immunogenetics 52 (1-2), 129-136 (2000) 

11132149 

2 (bases 1 to 2528) 

Meijerink,E. , Neuenschwander , S . , Stranzinger, G . and Vogeli,p. 
Direct Submission 

Submitted (24-MAR-1999) Institute of Animal Science, Federal 
Institute of Technology, Tannenstrasse 1, Zurich, CH 8 092, 
Switzerland 

Location/Qualifiers 

1. .2528 

/organism="Sus scrofa" 
/mo l_t yp e= "mRNA" 
/db_xref="taxon: 9823" 
/ chr omo s ome= " 6 " 
/map="6qll-q21" 

/tissue_type="small intestine" 

1. .2528 

/gene="FUTl" 

92. .1189 

/gene=" FUTl" 

/codon_start=l 

/product="alpha-l, 2-fucosyltransf erase" 

/protein_id="AAF59833.1" 

/db__xref="GI : 7328564" 

/translation="MWVPSRRHLCLTFLLVCVLAAIFFLNVYQDLFYSGLDLLALCPD 
HNWSSPVAIFCLAGTPVHPNASDSCPKHPASLSGTWTIYPDGRFGNQMGQYATLLAL 
AQLNGRQAFIQPAMHAVUVPVFRITLPVLAPEVDRHAPWRELELHDWMSEDYAHLKEP 
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WLKLTGFPCSWTFFHHLREQIRSEFTLHDHLRQEAQGVLSQFRLPRTGDRPSTFVGVH 
VRRGDYLRVMPKRWKGWGDGAYLQQAMDWFRARYEAPVFWTSNGMEWCRKNIDTSR 
GDVIFAGDGREAAPARDFALLVQCNHTIMTIGTFGFWAAYLAGGDTIYLANFTLPTSS 
FLKI FKPE AAFL PEWVGINADLS PLQMLAGP " 

ORIGIN 

Query Match 98.1%; Score 1245; DB 14; Length 2528; 

Best Local Similarity 99.5%; Pred. No. 6.4e-308; 

Matches 1259; Conservative 0; Mismatches 5; Indels 1;. Gaps 1; 
Qy 5 AGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTT 64 

N I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I M I I M I I I I I I I I I I I I M I 

Db 88 AGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTT 147 

Qy 65 AGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCT 124 

I I i I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 148 AGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCT 207 

Qy 125 GGCCCTGTGTCCAGACCATT^CGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGG 184 

N I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 208 GGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGG 2 67 

Qy 185 CACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGQ 244 

I I I I i I I M M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 268 CACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCCTTTCCGG 327 

Qy 245 GACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCT 304 

N I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 328 GACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCT 387 

Qy 305 GCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGT 364 

I I I I M I I I I I I I I I I I I It I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I t M I I I 
Db 388 GCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGT 447 

Qy 365 CCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGC 424 

I I I I I i I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I i I I I I I I I I I 

Db 448 CCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGC 507 

Qy 425 TCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGA 484 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 508 TCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGA 567 

Qy 485 GCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGA 544 

li MM III I MM Mill Mill lit II II II Mil tl 

Db 568 GCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGA 627 

Qy 545 GCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACT 604 

M I I II I II II II II t II I II II I t II I I I I I I t I t I 1 It I II I I I It I I I I II I t I I II 
Db 628 GCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACT 687 

Qy 605 GAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGT 664 

MIMMIII IMIIIIIIIIIIIIIIIII IIIIIIIMIIII 

Db 688 GAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGT 747 

Qy 665 GCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGG 724 

MMMMIIIMIIIIIIIIMMMMMI MM Ml M 

Db 748 GCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGATGG 807 

Qy 725 CCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGT 784 

I MMIIIIIIIIIII Illlllllllll illlllli MUM 

Db 808 CGCTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGT 867 
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ov^wivi:, ocaiv.li xvcauiia i^cLOiia lui iT.^piiL.aiiuii v/^'o^^^uo mm ocaiL.ii xvcauii uo-u:7-o^-^... i a^c j>r 



Qy 


785 


GGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGT 
1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTr r rr;r;r;r;rr ar r T 


o 4 4 


Db 


868 


Q O T 


Qy 


845 


GATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

GAT C T T T GC TGGC GATGGGC GGGAGGCC GC GC C C GC CAGGGAr T T Tnr ar Tar TP r Trr- n 


904 


Db 


928 


0 0*7 

987 


Qy 


905 


GTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 

GTGC AAC CAC AC CAT C ATGAC C AT TGGC AC C T TCGGC T TC TGGGr r r;r r T a r r TP r rT n r 


964 


Db 


988 


1U4 / 


Qy 


965 


TGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTT 

1 > 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 
TGGAGAT AC CATC TAG T TGGC TAAC T T CACC C T GC C C AP T TP r a r:r T T r r TP n a p 7\ T r- fp rp 


1024 


Db 


1048 


1107 


Qy 


1025 


TAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACT 

N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 

TAAACC C GAGGC TGC C T T C C TGC C C GAG TGGG TGGGP a T T a a T rzp n p a p T Tr- qi r* T^r" r* 7\ r» rp 


1084 


Db 


1108 


1167 


Qy 


1085 


CCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTCAACC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCAGATGTTGGC TGGGCC T TGAAC C AGP r af:;^ ar^PP TTTP TPP z\ 7\ T np r-r- rnr- rrir- 7\ TV /-« ^ 


1144 


Db 


1168 


1227 


Qy 


1145 


CAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGGTCCT 
1 1 > 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGGGCCAGCGTTATGGGTCTPPGGAar;pppp af::TaaPTTPPrrp7\r'7\fpr'r'rrr'r-m/-*r-mr^/-,m 


1204 


Db 


1228 


1287 


Ov 


1205 


X /\^A(jrLrU i LtCjAUAU i TAT T TCAAGAGTGAT TCTAATTGGC TGGAC T CAGAGGAAACCC 

mill f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 j 

GTAGCA-GCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAAACCC 


1264 


Db 


1288 


1346 


Qy 


1265 


TGCAG 12 69 




Db 


1347 


1 1 1 1 1 

TGCAG 1351 





RESULT 9 
AX029029 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AX029029 1098 bp 

Sequence 3 from Patent WO98057 68. 
AX029029 

AX029029.1 GI: 10190018 



DNA 



linear PAT 16-SEP-2000 



Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Suina; Suidae; 
Sus. 
1 

Mckenzie, I. F. and Sandrin,M.S. 

Improved nucleic acids encoding a chimeric glycosyltransf erase 
Patent: WO 9805768-A 3 12-FEB-1998; 

AUSTIN RESEARCH INST (AU) ; MCKENZIE IAN FARQUHAR CAMPBELL (AU) ; 
SANDRIN MAURO SERGIO (AU) 

Location/Qualifiers 

1. .1098 

/organism="Sus scrofa" 
/mol_type="unassigned DNA" 
/db_xref ="taxon: 9823" 
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o^y^rs^ ocoiv^u xvcduiLd x^cuuis tsji j^yyii^au\jii u:7o-r-t^uo miu ocaii^ii xvcduii ua-u:7-o-tn... x" 



ORIGIN 

Query Match 86.3%; Score 1094.8; DB 2; Length 1098; 

Best Local Similarity 99.8%; Pred. No. 2.3e-269; 

Matches 1096; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 



Qy 


9 


ATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTAGCA 


68 






1 t 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 r 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

> 1 > 1 1 M 1 M 1 1 1 M 1 1 M 1 1 1 1 1 1 M 1 M 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


1 


ATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTAGCA 


60 


Qy 


69 


- 

GCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTGGCC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 


128 


Db 


61 


M 1 1 M 1 1 1 M M 1 1 1 1 1 1 1 M 1 1 M 1 1 M 1 1 i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 
GCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTGGCC 


120 


Qy 


129 


CTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGCACG 


188 






1 1 1 1 1 1 1 [ 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 

1 > 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


CTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGCACG 


180 


Qy 


189 


CCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGGACC 


248 






1 1 1 t 1 1 ] 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 ^ , , 

1 N 1 1 i 1 1 1 M M 1 M M 1 1 1 M 1 M M M M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


181 


CCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGGACC 


240 


Qy 


249 


TGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTG 


308 






IN 1 M M M M M M M 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 




Db 


241 


TGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTG 


300 


Qy 


309 


GCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTG 


368 






1 1 1 1 I 1 i 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTG 


360 


Qy 


369 


GCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCT 


428 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ^ , 

N M M 1 1 1 M 11 M M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


GCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCT 


420 


Qy 


429 


TGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCC 


488 






t 1 1 1 1 t t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ■ 
i ■ M It M 1 1 M 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 ] j 1 1 1 1 1 




Db 


421. 


TGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCC 


480 


Qy 


489 


TGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAGCAG 


548 






1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 
1 ■ M 1 1 M 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 I 1 1 1 1 




Db 


481 


TGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAGCAG 


540 


Qy 


549 


ATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTGAGT 


608 






t 1 1 1 1 1 1 1 1 1 t t 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 
< M I 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M 1 j 1 M 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


ATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTGAGT 


600 


Qy 


609 


CAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTGCGC 


668 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 




Db 


601 


CAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTGCGC 


660 


Qy 


669 


CGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGCCGT 


728. 






1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


CGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGCGCT 


720 


Qy 


729 


TACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTGGTC 


788 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


TACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTGGTC 


780 


Qy 


7.8 9 ACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTGATC 


848 






1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M M 




Db 


781 


ACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTGATC 


840 
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Qy 


849 


Db 


841 


Qy 


909 


Db 


901 


Qy 


969 


Db 


961 


Qy 


1029 


Db 


1021 


Qy 


1089 


Db 


1081 



TTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAGTGC 908 
I I N I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I ,, I , 

TTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAGTGC 900 



AACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGTGGA 

I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I I I I M I I Ml M I I I I I I I I I M I 



968 



I I I I II I I II I 1.1 I I I I I I I I I II I I I I I I II I I I I II II I M II II II I I I II II I I I I 

GATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTTAAA 1020 
CCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTCCAG 1088 

N I I M II I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I M M I I I I I I M II 

CCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTCCAG 1080 

ATGTTGGCTGGGCCTTGA 1106 
I I I I I M I I I I I I I I I II 



RESULT 10 

AC168093/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AC168093 



199471 bp DNA 



linear HTG 24-JAN-2006 



WORKING DRAFT SEQUENCE, 18 unordered 



HTGS FULLTOP. 



Bos taurus clone CH240-234F5, 
pieces . 
AC168093 

AC168093. 3 GI: 85 6788 69 
HTG; HTGS_PHASE1; HTGS_DRAFT; 
Bos taurus (cattle) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; 
Pecora; Bovidae; Bovinae; Bos. 
REFERENCE 1 (bases 1 to 199471) 

AUTHORS Muzny,D., Adams, Agbai 11,0., Allen, C, Alsbrooks, S . , Archer,?., 
Arredondo,H., Bandaranaike, D . , Bangura,L., Beltran,B., Beltran,R., 
Beraducci,A., Biswalo,K., Blyth, P., Bonham,H., Buhay,C., Burch, P., 
Cadoree,I., Canada, A., Cardenas, V., Carter, K., Cavazos,!., 
Chacko,J., Chahrour,M., Chavez, D., Chen, A., Chen,G., Chen,R., 
Cheng,M.-T., Chu,J., Clerc,K., Cockrell,R., Coyle,M., Cree,A., 
Curry, S., Dai,W., Davila,M.L., Davis, C, Davy-Carroll, L . , De 
Anda,C., Delgado,0., Denson, S., Deramo,C., Ding,Y., Dinh,H., 
Donlin,J., McCauley,S., Dugan-Rocha, S . , Dunn, A., Durbin,K., 
Dziuda,D., Egan,A., Escotto,M., Espinosa,V., Eugene, C, Fa,M., 
Fernandez, S., Fernando, P., Flagg,N., Forbes, L., Foster, P., 
Fowler, G., Fu,Q., Fuh,E., Garcia, A., Garcia, R., Garner, T., 
Gaskin,C., Gench, S., Ghose,S., Gill,R., Gonzalez, D., 
Gonzalez-Garay,M., Guevara, W., Holder, M., Haaland,W., Haeberlen, K. , 
Hall,B., Hamid,H., Hamilton, K., Harbes,B., Harris, R., Havlak,P., 
Hawes,A., Hawkins, E., Hayes, S., Hemphill, L., Hernandez, J. , 
Hines,S., Hitchens,M., Hodgson, A., Hogues,M., Hollins,B., 
Howell, L.T., Hulyk,S., Hume, J., Imo,K., Jackson, A., Jackson, L., 
Jacob, L., Jiang, H., Johnson, B., Johnson, R., Kalafus,K., Kelly, S . , 
Keys,T., Khan,Z., King,L., Kovar,C., Kowis,A., Kowis,C., Lara,F., 
Leal, S., Lee, K., Lee,S., LeGall,F.I., Lemon, S . , Lewis, L., Li,B., 
Li,Y., Li,Z., Linnell,M., Liu,W., Liu,Y.-S., Liu,Y., Liyanage,D., 
London, P., Lopez, J., Lorensuhewa, L . , Lozado,R., Luk,T., Madu,R., 
Maheshwari,M., Mahoney,C., Malloy,K., Mansouri,D., Martinez, E., 
McClelland, H. , McPherson, J. , Mercadao,C., Metzker,M., 
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TITLE 
JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
CONSRTM 
TITLE , 
JOURNAL 



COMMENT 



Milosavljevic, A. , Minja,E., Morgan, M., Morris, S., Munidasa,M., 
Murray, D., Nazarith,L., Ngo,D., Nguyen, N., Norwig-Eastaugh, E . , 
Nott,A., Nwaokelemeh,0. , Obregon,M., Ochi-Okorie, C . , Odeh,E., 
Okwuonu,G., Okwuonu,K., Parker, D., Pasternak, S . , Patel,B., 
Patel,V., Paul,H., Perez, A., Perez, L., Petrosino, J. , Pham, T., 
Primus, E., Pu,L.-L., Puazo,M., Qin,X., Quinn,A., Quiroz,J., 
Rabata,D., Rachlin,E., Reigh,R., Ren,Y., Reuter,M., Richards, S., 
Rives, C, Rodriguez, F., Rojas,A., Ruiz, S.J. , Sana,M., Sanders, W., 
Santibanez, J. , Santos, R., Savery,G., Scherer,S., Shen,H., Shen,Y., 
Sisson,I., Sneed,A., Sodergren, E . , Song, X. -2., Sorelle,R., 
Svatek,A., Taylor, E., Taylor, T., Thomas, N., Thorn, R., Thornton, R., 
Trejos,Z., Usmani,K., Vargo,C., Verduzco,D., Villasana, D . , Virk,D., 
Volkov,A., Waldron,L., Walker, B., Wang,Q., Wang,S., Warren, J., 
Wei,X., Wheeler, D., Williams, G., Williams, R., Worley,K., Wright, R., 
Wu,J., Yakub,S., Yan,K., Yuan, Y., Yu, F., Zhang, J., Zhang, L., 
Zhang, Z., Zhou, J., Weinstock,G. and Gibbs,R.A. 
Direct Submission 
Unpublished 

2 (bases 1 to 199471) 
Worley,K.C. 

Direct Submission 

Submitted { 12-SEP-2005) Human Genome Sequencing Center, Department 
of Molecular and Human Genetics, Baylor College of Medicine, One 
Baylor Plaza, Houston, TX 77030, USA 

3 (bases 1 to 199471) 

Bovine Genome Sequencing Consortium 
Direct Submission 

Submitted (24-JAN-2006) Human Genome Sequencing Center, Department 
of Molecular and Human Genetics, Baylor College of Medicine, One 
Baylor Plaza, Houston, TX 77030, USA 

On Jan 24, 2006 this sequence version replaced gi: 74474967. 
The sequence in this assembly is a combination of BAC based reads 
and whole genome shotgun sequencing reads assembled using Atlas 
(http://ww.hgsc.bcm.tmc.edu/projects/rat/). Each contig described 
in the feature table below represents a scaffold in the Atlas 
assembly (a * contig-scaf f old ' ) . Within each contig-scaf f old, 
individual sequence contigs are ordered and oriented, and separated 
by sized gaps filled with Ns to the estimated size. The sequence 
may extend beyond the ends of the clone and there may be sequence 
contigs within a contig-scaf fold that consist entirely of whole 
genome shotgun sequence reads. Both end sequences and whole genome 
shotgun sequence only contigs will be indicated in the feature 
table . 

Genome Center 

Center: Baylor College of Medicine 
Center code: BCM 

Web site: http://www.hgsc.bcm.tmc.edu/ 

Contact: hgsc-help@bcm.tmc.edu 
Project Information 

Center project name: FMZS 

Center clone name: CH240-234F5 
Summary Statistics 

Assembly program: Atlas 3.0; 

Consensus quality: 191995 bases at least Q40 

Consensus quality: 193265 bases at least Q30 

Consensus quality: 194517 bases at least Q20 

Estimated insert size: 197846; sum-of-contigs estimation 

Quality coverage: 7x in Q20 bases; sum-of-contigs estimation 



NOTE: Estimated insert size may differ from sequence length 
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(see http://www.hgsc.bcm.tmc.edu/docs/Genbank_draft_data.html) . 
NOTE: This is a 'working draft' sequence. It currently 
consists of 18 contigs . The true order of the pieces 
is not known and their order in this sequence record is 
arbitrary. Gaps between the contigs are represented as 
runs of N, but the exact sizes of the gaps are unknown. 
This record will be updated with the finished sequence 
as soon as it is available and the accession number will 



FEATURES 

source 



•k 


be preserved. 










-k 


1 


43939: 


contig 


of 


43939 bp in length 


■A- 


43940 


43989: 


gap of 


50 


bp 




* 


43990 


47481: 


contig 


of 


3492 


bp in length 




47482 


47531: 


gap of 


50 


bp 






47532 


82917: 


contig 


of 


35386 bp in length 




82918 


82967: 


gap of 


50 


bp 




* 


82968 


84638: 


contig 


of 


1671 


bp in length 




84639 


84688: 


gap of 


50 


bp 




★ 


84689 


94185: 


contig 


of 


9497 


bp in length 


* 


94186 


94235: 


gap of 


50 


bp 






94236 


106647: 


contig 


of 


12412 bp in length 


★ 


106648 


106697: 


gap of 


50 


bp 




★ 


106698 


115377: 


contig 


of 


8680 


bp in length 


★ 


115378 


115427: 


gap of 


50 


bp 




115428 


155940: 


contig 


of 


40513 bp in length 




155941 


155990: 


gap of 


50 


bp 




★ 


155991 


165469: 


contig 


of 


9479 


bp in length 


* 


165470 


165519": 


gap of 


50 


bp 




★ 


165520 


167201: 


contig 


of 


1682 


bp in length 




167202 


167251: 


gap of 


50 


bp 






167252 


171689: 


contig 


of 


4438 


bp in length 


★ 


171690 


171739: 


gap of 


50 


bp 


* 


171740 


182819: 


contig 


of 


11080 bp in length 


★ 


182820 


182869: 


gap of 


50 


bp 




* 


182870 


188368: 


contig 


of 


5499 


bp in length 


★ 


188369 


188599: 


gap of 


231 


- bp 


* 


188600 


192002: 


contig 


of 


3403 


bp in length 


* 


192003 


193346: 


gap of 


1344 bp 


★ 


193347 


194706: 


contig 


of 


1360 


bp in length 


★ 


194707 


194806: 


gap of 


unknown 


length 


* 


194807 


195984: 


contig 


of 


1178 


bp in length 


* 


195985 


196084: 


gap of 


unknown 


length 


★ 


196085 


197474: 


contig 


of 


1390 


bp in length 


★ . 


197475 


197574: 


gap of 


unknown 


length 


* 


197575 


199471: 


contig 


of 


1897 


bp in length. 




Location/Qualifiers 










1. 


.199471 











gap 



gap 



gap 



gap 



gap 



gap 



/organism="Bos taurus" 
/mol_type="genomic DNA" 
/db_xref="taxon:9913" 
/clone="CH24 0-234F5 " 
43940. .43989 
/estimated_length=50 
47482. .47531 
/estimated_length=50 
82918. .82967 
/estimated_length=50 
84639. .84688 
/estimated_length=50 
94186. .94235 
/estimated_length=50 
106648. .106697 
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ON^vyxvu. ocoii^ii ivcouiia j^cuaiia lui rvppin^aLUjii \jy<y^-t^Kjo cuiu ocaiv^ii xvcauit u»-u:7-^ 





/estimated length=50 


gap 


' 115378. .115427 




/estimated length=50 


gap 


155941. .155990 




/estimated length=50 


gap 


165470. .165519 




/estimated_length=50 


gap 


167202. .167251 




/estimated length=50 


gap 


171690. .171739 




/estimated length=50 


gap 


182820. .182869 




/estimated_length=50 


gap 


188369. .188599 




/estimated length=231 


gap 


192003. .193346 




/estimated_length=134 4 


gap 


194707. .194806 




/ estimated_length=unknown 


gap 


195985. .196084 




/estimated length^unknown 


gap 


197475. .197574 




/estimated_length=unknown 



• ORIGIN 

Query Match 68.6%; Score 871; DB12; Length 199471; 

Best Local Similarity 83.3%; Pred. No. 2.6e-212; 

Matches 1050; Conservative 0; Mismatches 205; Indels 6; Gaps 5; 

1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

II I II I I I I I I I I III III III I I I I I I I I I I I I I I I I I I I I I I I Mill 
81900 CTGCAGCCATGTGGGCTCCCGGCCACCGTCACCTCTGTCTGATCTTCCTGCTAACCTGTG 81841 

61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 
I I I I II I I I M I I I I I II I III I I I I I I I I I I I I I I I I I I I I I II 

81840 TTTTTGCCTGCGTCTTCTTCCTCCTCATCCACCAAAACCTCTTTCACAGTGGCTTAGACC 81781 

121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATAfTCTGCCTGG 180 

III I I I I I I I I I I II I I I I I I I I I I I I I I I 

81780 TGTTCCTGCTGTGTCCAGACCGTAGCCGGGTGAGATCCCCCGTGGCCATCCTTTGCCTGT 81721 

181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

I I I I I I I I I I I I I I II I I I I I I I I II 

81720 CAGGCACACCGATGAACCCCAATGCCACCTTTACCTGTCCCAGGCATTCTGCTTCTGTCT 81661 

241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

' N II I I I I I I I I MM I I M I M I II I I I I I I I I II I I I I Mill I I I I 

81660 CAGGAACTTGGACTATCGACCCCAAAGGCCGGTTTGGGAACCAGATGGGGCAGTACGCCA 81601 
301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

N I I I I I M M M I I I I II I II II I I I I M I I I I I II II M I I I I I I I II II M I I 

81600 CGCTGCTGGCCCTGGCCCAGCTCAATGGCCGCCAGGCCTTCATCCAGCCCTCCATGCACG 81541 

361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 
I I I I I M I II I I I I II II II I I I I II II II I I I II II I II I I I I II II I II II I I 
81540 CCGTCCTGGCCCCCGTGTTCCGCATCACACTGCCCGTGCTGGCGCCCGAGGTGGACAGAC 81481 

421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 
' II M II M I M I I I I II I I II M I II I I II I I I I I II I I M II II I II I I I II I 
81480 ACGCTCCTTGGCAGGAGCTGGAGCTTCATGACTGGATGTCGGAGGAGTACGCCCACTTGA 81421 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 
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o^v^ivj-; ocaiL.li Avcauita a^cloii^ lui ^frpiiuauuii w:7o*t-r^uo oiiu ocan^ii xvcauiL ua-u:7-o-r-t... x agc j:7 



Qy 

Db 


481 
81420 


AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 
1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 
AGGAGCCCTGGCTGAAGCTCACGGGCTTCCCCTGCTCCTGGACCTTCTTCCATCACCTCC 


540 
81361 


Qy 


541 


GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 
UN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M Mill 1 1 1 1 1 III 1 
GGGACCAGATCCGCAGCGAGTTCACCCTACACGAGCATCTGCGGCAGGAGGCGCAGCGTT 


600 


Db 


81360 


81301 


Qy 


601 


TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 

> N 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 Mil 

CACTGAGTGGGCTCCGCTTCCCCCGCACGGGAGGCCGCCCGAGCACCTTTGTGGGTGTCC 


660 


Db 


81300 


81241 


Qy 


661 


ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 

1 1 1 1 1 M 1 II 1 1 1 1 II II 1 L 1 1 1 M II 1 II 1 1 M 1 1 1 II 1 1 1 1 

ATGTGCGCCGCGGGGACTACCTACAGGTGATGCCCCTCCACTGGAAGGGTGTGGTGGGGG 


720 


Db 


81240 


81181 


Qy 

Db 


721 
81180 


ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 
1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II M 1 M 1 1 1 1 1 1 1 II II 1 M 1 II III 
ATCGCGCTTACCTCCAACAGGCTATGGACTGGTTCCGGGCCCGGCACAAAGCCCCCATCT 


780 
81121 


Qy 


781 


TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 

1 1 1 1 1 1 M 1 1 M M 1 1 M 1 1 M 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 t 1 1 1 M 1 1 M 1 1 1 

TTGTGGTCACTAGCAACGGCATGAAGTGGTGCCGGGAAAACATTGACACCTCCCGGGGGG 


840 


Db 


81120 


81061 


Qy 


841 


ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 
1 1 1 1 1 1 1 i t 1 1 -1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 II II 1 1 1 M II 1 1 1 I 1 1 1 
ATGTGATCTTCGCCGGAGACGGGCAGGAGGGTGCGCCCAACAAAGACTTTGCGCTGCTCA 


900 


Db 


81060 


81001 


Qy 

Db 


901 
81000 


TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 
II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 
CACAATGCAACCACACCATCATGACCATTGGCACCTTTGGCTTCTGGGCCGCCTACCTGG 


960 
80941 


Qy 


961 


CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 

Ill MIIIMII MM IIIIIIIIIIIMI MM IIIIMI 

CTGGTGGAGACACCATCTACCTGGCCAACTTCACCCTGCCTGATTCCAGCTTCCTGAAGA 


1020 


Db 


80940 


80881 


Qy 


1021 


TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 

Mlllll IIIIIIM Mllll Mllll MM 

TCTTTAAACCTGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 


1080 


Db 


80880 


80821 


Qy 


1081 


CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 

1 MM 1 II II II 1 1 II 1 1 1 1 1 M 1 II 1 1 II 

CGCTCCA-ATGAAGGCTGAGTCTTG-AGGAACAGGGAGACTTTCTGAAATAGCCT-GATC 


1140 


Db 


80820 


80764 


Qy 


1141 


AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 

M MIIMI MM MM IMIIIil II Mllll 

CACGTAGGGCCAATAGTATGCATCTCTGGAAGCCC— AGAAGATTCTAAAAAGCTTGTGG 


1200 


Db 


80763 


80706 


Qy 


1201 


TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 

II II It II 1 II [ 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 r 1 1 1 1 Ill II 

If II II II 1 1 II 1 1 1 1 1 1 1 Mill 1 1 II 1 1 Mllll 1 II 1 II 1 M 

TCTTGCAGGTGG-TCGACACCCATTTCTAGAGTCATTCTAGTTGGCTAGACTTGGAGAAA 


1260 


Db 


80705 


80647 


Qy 


1261 


A 1261 




Db 


80646 


1 

A 80646 





RESULT 11 
AF394920S2 

LOCUS AF394920S2 1867 bp mRNA linear MAM 23-SEP-2001 

DEFINITION Bos taurus alpha (1-2) fucosyltransf erase 1 (futl) gene, exon 2 and 
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ov^wiNo:, ocoioii xvcauiLd A^rciona iwi rvppin^auwii v/:7o-rnz.uo oiiu ocan^ii ivcauit ua-v^-o-t-r,,. ra^c ^/ 



ACCESSION 

VERSION 

KEYWORDS 

SEGMENT 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



gene 
exon 

CDS 



ORIGIN 



complete cds . 
AF394921 

AF394 921. 1 GI: 15723257 
2 of 2 

Bos taurus (cattle) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Riominantia; 
Pecora; Bovidae; Bovinae; Bos. 

1 (bases 1 to 1867) 

Saunier,K., Barreaud, J. P. , Eggen,A., Oriol,R., Leveziel,H., 
Julien,R. and Petit, J. -M. 

Organization of the Bovine alpha2-Fucosyltransf erase Gene Cluster 
Suggests that the Seel Gene Might Have Been Shaped Through a 
Nonautonomous Ll-Retrotransposition Event Within the Same Locus 
Mol. Biol. Evol. 18 (11) (2001) In press 

2 (bases 1 to 1867) 

Saunier,K., Barreaud, J. P . , Eggen,A., Oriol,R., Leveziel,H.> 
Julien,R. and Petit, J. -M. 
Direct Submission 

Submitted (26- JUN-2001) Unite de Genetique Moleculaire Animale, 
UR1061, Faculte des Sciences, 123, Avenue Albert Thomas, Limoges 
87060, France 

Location/Qualifiers 

1. .1867 

/organism="Bos taurus" 
/mol_type="mRNA" 
/db_xref ="taxon: 9913" 
/ c hr omo s ome ="18" 
/map="18q24" 

join(AF394920.1:l. .263,1. .1867) 

/gene="futl" 

1. .1867 

/gene="futl" 

/number=2 

6. .1088 

/gene="futl" 

/codon_start=l 

/product="alpha(l-2) fucosyltransf erase 1" 
/protein_id="AAL06321 .1" 
/db_xref="GI: 1572 3258" 

/translation="MWAPGHRHLCLIFLLTCVFACVFFLLIHQNLFHSGLDLFLPCPD 
RXRVRS PVAI LC LSGTLMNPNAT FTC PRHS AS VS GTWT I D PKGRFGNQMGQ YAT LVTVL 
AQLNGRQAFIQPSMHAVLAPVFRITLPVLAPEVDRHAPWQELELHDWMSEEYAHLKEP 
WLKLTGFPCSWTFFHHLRDQIRSEFTLHEHLRQEAQRSLSGLRFPRTGGRPSTFVGVH 
VRRGDYLQVMPLHWKGWGDRAYLQQAMDWFRARHKAPIFWTSNGMKWCRENIDTSR 
GDVIFAGDGQEGAPNKDFALLTQCNHTIMTIGTFGFWAAYLAGGDTIYLANFTLPDSS 
FLKI FKPE AAFL PEWVGINADLS PLQ " 



Query Match 68.1%; Score 864.4; DB14; Length 1867; 

Best Local Similarity 83.1%; Pred. No. 2.5e-210; 

Matches 1044; Conservative 0; Mismatches 207; Indels 6; 



Gaps 



5; 



Qy 

Db 

Qy 



5 AGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTT 64 

NIMIIIIM III III III lllllllllllll Mllllllll MM 

2 AGCCATGTGGGCTCCCGGCCACCGTCACCTCTGTCTGATCTTCCTGCTAACCTGTGTTTT 61 

65 AGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCT 124 
I IIIN I II I III IMIIIII III! Mill I 
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o^^v/ivjj. ocoiv^u ivcauita i^cioiia lui r^yyn^auuii u^/o-r-t^uo oiiu ocan^u ±vcauii ua-v^-o-t-r... x a^c 

62 TGCCTGCGTCTTCTTCCTCCTCATCCACCAAAACCTCTTTCACAGTGGCTTAGACCTGTT 121 

Qy 125 GGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGG 184 

I I I M I M I I I I I I I III II I I I I I II I I I II I I I II I I I I I 
122 CCTGCCGTGTCCAGACCGTANCCGGGTGAGATCCCCCGTGGCCATCCTTTGCCTGTCAGG 181 

Qy 185 CACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGG 244 

I I I I I I I I I II I I I M I I I I M I II I I I II I I I II I I I I I I M 
182 CACACTGATGAACCCCAATGCCACCTTTACCTGTCCCAGGCATTCTGCTTCTGTCTCAGG 241 

Qy 245 GACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCT 304 

II I I I I M I I I I I I I I I I I I I I I I I II I i I I I I I I I II 11 I I I I I I I 11 I I 
Db 242 AACTTGGACTATCGACCCCAAAGGCCGGTTTGGGAACCAGATGGGGCAGTACGCCACGCT 301 

Qy 305 GCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGT 364 

I I I I M I I I I I II I I M II M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

302 GGTGGCCCTGGCCCAGCTCAATGGCCGCCAGGCCTTCATCCAGCCCTCCATGCACGCCGT 361 

Qy 365 CCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGC 424 

I I I N I I I M I M I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
362 CCTGGCCCCCGTGTTCCGCATCACACTGCCCGTGCTGGCGCCCGAGGTGGACAGACACGC 421 

Qy ^25 TCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGA 484 

I N I I I I I I II I I I I I I I I I I I I I I I I I II I I I I Mill II I I I I M I I I I I I I 
Db 422 TCCTTGGCAGGAGCTGGAGCTTCATGACTGGATGTCGGAGGAGTACGCCCACTTGAAGGA 481 

Qy 485 GCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGA 544 

I I I M I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I t I I i I I I I I I I I I I I I I I I I I 
Db 482 GCCCTGGCTGAAGCTCACGGGCTTCCCCTGCTCCTGGACCTTCTTCCATCACCTCCGGGA 541 

Qy 545 GCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACT 604 

llllllilllltlllllllllll Mill II II Mill MM Ill 

Db 542 CCAGATCCGCAGCGAGTTCACCCTACACGAGCATCTGCGGCAGGAGGCGCAGCGTTCACT 601 

Qy 605 GAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGT 664 

UN I II II I II t M I I I I I I II II II I M I I II I II I II M I I I I I 

Db 602 GAGTGGGCTCCGCTTCCCCCGCACGGGAGGCCGCCCGAGCACCTTTGTGGGTGTCCATGT 661 

Qy 665 GCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGG 724 

I I I I I I i I I I I M M I I I 1 II II I II I I II I I I I I II I I I 

Db 662 GCGCCGCGGGGACTACCTACAGGTGATGCCCCTCCACTGGAAGGGTGTGGTGGGGGATCG 721 

Qy 725 CCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGT 784 

I Ml I I I II I I M I I I II I Mill II II I M I II I 

Db 722 CGCTTACCTCCAACAGGCTATGGACTGGTTCCGGGCCCGGCACAAAGCCCCCATCTTTGT 781 

Qy 785 GGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGT 844 

llllli il INI illM 1 I Mill I I I II I I I M M I I II I II 

Db 782 GGTCACTAGCAACGGCATGAAGTGGTGCCGGGAAAACATTGACACCTCCCGGGGGGATGT 841 



Qy 845 GATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCA 904 

Ill Mill llllll II IIIMIIIMIIII M 

GATCTTCGCCGGAGACGGGCAGGAGGGTGCGCCCAACAAAGACTTTGCGCTGCTCACACA 901 



Db 842 



Qy 905 GTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGG 964 

MMMMMMMMIMMMI llllllllllllllll Mill 

^b 902 ATGCAACCACACCATCATGACCATTGGCACCTTTGGCTTCTGGGCCGCCTACCTGGCTGG 961 

Qy 965 TGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTT 1024 

llllll IIIIMIII MM II Mill IIMIIIIMI Mill 

Db 962 TGGAGACACCATCTACCTGGCCAACTTCACCCTGCCTGATTCCAGCTTCCTGAAGATCTT 1021 
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ov^v^xvi^ ocaiL.li xvcauiia iycioiia lui r^ppiivauuu u^^o-rn^uo aiiu ocaiL.u Avcauit ud-uj7-on-r. 



Ov 


1 0? s 


nh 




Ov 


X U O >J 


nh 

LJiJ 




wy 


X i. fi o 


Db 


1139 


Qy 


1205 


Db 


1197 



TAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACT 

NUN M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I i I I I I I I I I I I I I I I M I II 

TAAACCTGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCGCT 



1084 



1081 



1144 



CCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTCAACC 

N I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

CCA-ATGAAGGCTGAGTCTTG-AGGAACAGGGAGACTTTCTGAAATAGCCT-GATCCACG 1138 



I I I I 



I I I I I I I I I I I M I I I 



II II I I II I I I I I I I I 



I I I I I I I I I I I I II I I I I I I I I I I I I I M I II I I II II 
GCAGGTGG-TCGACACCCATTTCTAGAGTCATTCTAGTTGTCTAGACTTGGAGAAAA 1252 



I I 



RESULT 12 

AF186465 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
itiRNA 

CDS 



AF186465 1083 bp DNA linear MAM 18-OCT-2000 

Bos taurus alpha (1-2) fucosyltransf erase 1 (futl) gene, complete 
cds . 

AF186465 

AF186465.1 GI:6409299 

Bos taurus (cattle) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; 
Pecora; Bovidae; Bovinae; Bos. 

1 (bases 1 to 1083) 

Barreaud, J.P., Saunier,K., Souchaire, J. , Delourme^D., Oulmouden, A. , 
Oriol,R., Leveziel^H., Julien,R. and Petit, J. M. 

Three bovine alpha2-fucosyltransf erase genes encode enzymes that 
preferentially transfer fucose on Galbetal-3GalNAc acceptor 
substrates 

Glycdbiology 10 (6), 611-621 (2000) 
10814703 

2 (bases 1 to 1083) 

Saunier,K., Giraud-Delville, C . and Furet,J.P. 
Direct Submission 

Submitted (14-SEP-1999) Departement de Genetique Animale, INRA, 
Laobratoire de Genetique Biochimique et de Cytogenetique, Jouy en 
Josas 78352, France 

Location/Qualifiers 

1. .1083 

/organism="Bos taurus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 9913" 
/ c h r omo s ome= "18" 
1. .1083 
/gene="futl" 
<1. .>1083 
/gene="futl" 

/product="alpha(l-2) fucosyltransf erase 1" 

1. .1083 

/gene="futl" 

/codon_start=l 

/product="alpha(l-2) fucosyltransf erase 1" 
/pr o tein_id= "AAF07 933 . 1 " 
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/db_xref ="GI : 6409300" 

/translation="MWAPGHRHLCLIFLLTCVFACVFFLLIHQNLFHSGLDLFLPCPD 

RXRVRSPVAILCLSGTLMNPNATFTCPRHSASVSGTWTIDPKGRFGNQMGQYATLVAL 

AQLNGRQAFIQPSMHAVIAPVFRITLPVIJVPEVDRHAPWQELELHDVinyLSEEYAHLra 

WLKLTGFPCSWTFFHHLRDQIRSEFTLHEHLRQEAQRSLSGLRFPRTGGRPSTFVGVH 

VRRGDYLQVMPLHWKGWGDRAYLQQAMDWFRARHKAPIFWTSNGMKWCRENIDTSR 

GDVI FAGDGQEGAPNKD FALLTQCNHT IMT IGT FGFWAAYLAGGDT I YLANFTL PDS S 

FLKI FKPE AAFL PE WVG I NAD LS P L Q " 

ORIGIN 



Query Match 64.9%; Score 823.6; DB 14; Length 1083; 

Best Local Similarity 85.2%; Pred. No. 7.8e-200; 

Matches 919; Conservative 0; Mismatches 160; Indels 0; Gaps 0; 



Qy 


9 


ATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTAGCA 

1 1 1 1 1 1 1 III III III 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 > 1 II 
1 > 1 II II III III III 1 1 1 1 1 II II 1 1 1 1 i II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 

ATGTGGGCTCCCGGCCACCGTCACCTCTGTCTGATCTTCCTGCTAACCTGTGTTTTTGCC 


68 


Db 


1 


60 


Qy 


69 


GCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTGGCC 

1 1 1 t 1 1 [ 1 1 1 II 1 III ] 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 

> I M 1 M 1 1 1 II 1 Ml II j 1 1 M 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 
TGCGTCTTCTTCCTCCTCATCCACCAAAACCTCTTTCACAGTGGCTTAGACCTGTTCCTG 


128 


Db 


61 


120 


Qy 


129 


CTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGCACG 
1 1 1 1 1 1 1 1 t 1 1 1 II 1 III III 1 1 1 1 t I 1 1 1 1 1 1 1 1 I 1 1 1 1 

I 1 1 1 1 1 1 1 1 1 1 1 II 1 III III 1 1 1 1 1 1 M 1 II 1 1 1 1 1 II 1 1 1 1 1 1 

CCGTGTCCAGACCGTANCCGGGTGAGATCCCCCGTGGCCATCCTTTGCCTGTCAGGCACA 


188 


Db 


121 


180 


Qy 


189 


CCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGGACC 

1 1 1 1 1 1 1 1 1 1 III II 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 II 

> > 1 1 1 1 1 1 1 1 III II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 II 1 II II II 

CTGATGAACCCCAATGCCACCTTTACCTGTCCCAGGCATTCTGCTTCTGTCTCAGGAACT 


248 


Db 


181 


240 


Qy 


249 


TGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTG 

1 1 II 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 
f > 1 > ■ ■ 1 1 II 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II II 

TGGACTATCGACCCCAAAGGCCGGTTTGGGAACCAGATGGGGCAGTACGCCACGCTGGTG 


308 


Db 


241 


300 


Qy 


309 


GCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTG 

1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 1 1 1 1 It II M 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 
' 1 ■ ' ■ 1 ■ I 1 1 1 II 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II 1 1 

GCCCTGGCCCAGCTCAATGGCCGCCAGGCCTTCATCCAGCCCTCCATGCACGCCGTCCTG 


368 


Db 


301 


360 


Qy 

Db 


369 
361 


GCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCT 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II i 1 1 1 1 1 1 1 1 1 II It t M 1 1 1 1 1 t 1 1 1 1 1 
I 1 I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GCCCCCGTGTTCCGCATCACACTGCCCGTGCTGGCGCCCGAGGTGGACAGACACGCTCCT 


428 
420 


Qy 

Db 


429 
421 


TGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCC 
1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 II 1 1 M 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 
TGGCAGGAGCTGGAGCTTCATGACTGGATGTCGGAGGAGTACGCCCACTTGAAGGAGCCC 


488 
480 


Qy 


489 


TGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAGCAG 

1 1 1 1 1 1 M M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 II 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 III 

TGGCTGAAGCTCACGGGCTTCCCCTGCTCCTGGACCTTCTTCCATCACCTCCGGGACCAG 


548 


Db 


481 


540 


Qy 

Db 


549 
541 


ATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTGAGT 

Ml III Ill Mill Ml 1 II Mill 

ATCCGCAGCGAGTTCACCCTACACGAGCATCTGCGGCAGGAGGCGCAGCGTTCACTGAGT 


608 
600 


Qy 


609 


CAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTGCGC 
1 MM 1 M 1 1 II 1 1 II 1 II 1 1 II II 1 M 1 1 1 1 1 1 II 1 II 1 1 II 1 II 1 
GGGCTCCGCTTCCCCCGCACGGGAGGCCGCCCGAGCACCTTTGTGGGTGTCCATGTGCGC 


668 


Db 


601 


660 


Qy 

Db 


669 
661 


CGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGCCGT 

1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 III 1 1 1 1 1 1 1 II 1 

CGCGGGGACTACCTACAGGTGATGCCCCTCCACTGGAAGGGTGTGGTGGGGGATCGCGCT 


728 
720 
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Qy 


729 


Db 


721 


Qy 


789 


Db 


781 


Qy 


849 


Db 


841 


Qv 


909 


Db 


901 


Ov 


969 


Db 


961 


Qy 


1029 


Db 


1021 



TACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTGGTC 
I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 II II I I I I I I I II I M I I I I I 
TACCTCCAACAGGCTATGGACTGGTTCCGGGCCCGGCACAAAGCCCCCATCTTTGTGGTC 



II I I I I I I I I I I II I I I II I I II I I I I I II I I I I I II I I I I M I I I I I I II I I I 
ACTAGCAACGGCATGAAGTGGTGCCGGGAAAACATTGACACCTCCCGGGGGGATGTGATC 

TTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAGTGC 
II I I I I I I II I I I II I I I I I I I I I I I II 



AACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGTGGA 

I I I II I I I t II I I I I I I I I I M II I I I I I IIIIMIIIIIIIIIIIIIMIIII 

AACCACACCATCATGACCATTGGCACCTTTGGCTTCTGGGCCGCCTACCTGGCTGGTGGA 

GATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTTAAA 

II I I I I II I II I I I I I I I M I I I I I I I I I I I II I II I I II I I I I I I II I I I I I I 
GACACCATCTACCTGGCCAACTTCACCCTGCCTGATTCCAGCTTCCTGAAGATCTTTAAA 



788 



780 



848 



840 



968 



960 



1028 



1020 



CCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTCCA 1087 
II I I I I I I I I I I II I I I I I I M I I I II I M I I M I I I I I I I I I I I I I I I I I I I I I I I 
CCTGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCGCTCCA 1079 



RESULT 13 

AR075308 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
FEATURES 

source 



ORIGIN 



AR075308 8174 bp DNA 

Sequence 5 from patent US 5955347. 
AR075308 

AR075308.1 GI: 10002058 



linear PAT 28-AUG-2000 



Unknown . 

Unknown. 

Unclassified. 

1 (bases 1 to 8174) 

Lowe , J . B . 

Methods and products for the synthesis of oligosaccharide 
structures on glycoproteins, glycolipids, or as free molecules, and 
for the isolation of cloned genetic sequences that determine these 
structures 

Patent: US 5955347-A 5 21-SEP-1999; 
Location/Qualifiers 
1. .8174 

/organism="unknown" 
/mol_type="unassigned DNA" 



Query Match 63.7%; 
Best Local Similarity 79.6%; 
Matches 993; Conservative 



Score 808.6; DB 2; 
Pred. No. 4.1e-196; 
0; Mismatches 249; 



Length 8174; 



Indels 



Qy 



Db 



Qy 



Db 



5; Gaps 



3; 



1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

II MIIIIIMI Ml II I I IIIM II 

4678 CTGCAGCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTG 4737 

61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

I I I I IN IIIIIMI I II II I MM MM III II 

4738 TCCTCTCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCC 4797 
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o^KJiy^ occu^ii xvc:>uita i^ctoii^ lui /^ppiii^auuii u^o^^^^o oiiu ocaii.ii xvcauit u:>-u^-onn... r a^c ui 



Qy 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

l< I III Ill i MM M I II llllllll IIMIIIII 

TGTCGATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGC 4857 



Db 4798 



Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

M M I I III I M I I I II I 11 I I M I I I M 11 I II I I I I I I 

Db 4858 CGGGTACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCT 4 917 



Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

MM MMIIIII I INI IIMIIIII II 11 Mllllllllll 

CCGGCACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCA 4 977 



Db 4 918 



Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

"Mllllll Mill II Illilll IIMM MM IIMIIII 

CGCTGCTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATG 5037 



Db 4978 



Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

"1 MMIIIII II M II M I M II II II I M Mill II II || II I I I I 

CCGCCCTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCC 5097 



Db 5038 



Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 
I M I I II II II II II II II II Mill M II II II I 

GCACGCCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGA 5157 



Db 5098 



Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

'I M I I II I II II I I II II I II II I M M II II I II II II II II 

GAGATCCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCC 5217 



Db 5158 



Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

MM II II II II II I II II II M II II II t I II t II M II I I II II II I III I I 

Db 5218 GGGAACAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTG 5277 



Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

IIMIIIII MMMMMMMMM M IIMM 

TGCTGGGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCC 



Db 5278 



5337 



Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

"MMIMI I I II Mill MMIMIMIM MIIMII I 

ACGTGCGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCG 



Db 5338 



5397 



Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

M M IMIMI IMIII IIIIIIIIIIMIIIM II lllllllllllll I 

ACAGCGCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTT 5457 



Db 5398 



Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I MIIMIIIItlllllllllllllMIM I lltlllllllllMII III I 

TCGTGGTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCG 5517 



Db 5458 



Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I M II M M II I II II II I I II M II III I lllllllllllll 

Db 5518 ATGTGACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCA 5577 



Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

M M II II II II II M I II II II II II II II M II I I II I I II II I II II II II II 

CACAGTGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGG 5637 



Db 5578 



Qy CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

M M II I II II II II I M II M II M II II II II II M II II M II 

Db 5638 CTGGCGGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGA 5697 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 
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^^^^y^ ocoiuii xvcauiL:> i^cioiis lui rvppin^auwii \jyo^^z^KJo aiiu ocaiL/ii xvcauii ua-w:7-oT^n. 





ooyo 


Ov 


1 Oft 1 


nh 


O / O 0 




114 1 


Db 


5816 


Qy 


1201 


Db 


5875 



I I I I M I M Mill I I I II I I I I II I II II I I I M II I I 1! li I i I I I I I I I I I I I I 



I II I 



I I I 



I I I 



I I I I I I I 



I I 



I I 



I III 



I I I I I I I I I I I I I II I II I I I I I I I I I I I I M I I I I I I I I 



TCCTGTAGCAG — GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 124 5 
I I III III I I I I II II M I II I I I I I I I 

TGTTCCTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 5921 



RESULT 14 

134189 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
FEATURES 

source 



ORIGIN 



linear PAT 06-FEB-1997 



134189 8174 bp DNA 

Sequence 5 from patent US 5595900. 
134189 

134189.1 GI:1824980 



Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 8174) 

Lowe, J.B. 

Methods and products for the synthesis of oligosaccharide 
structures on glycoproteins, glycolipids, or as free molecules, and 
for the isolation of cloned genetic sequences that determine these 
structures 

Patent: US 5595900-A 5 21-JAN-1997; 
Location/Qualifiers 
1. .8174 

/organism="unknown" 
/mol_type="unassigned DNA" 



Query Match 63.7%; 
Best Local Similarity 79.6%; 
Matches 993; Cons erva tive 



Score 808.6; DB 2; 
Pred. No. 4.1e-196; 
0; Mismatches 249; 



Length 8174; 



Indels 



5 ; Gaps 



3; 



Qy 


1 


Db 


4678 


Qy 


61 


Db 


4738 


Qy 


121 


Db 


- 4798 


Qy 


181 


Db 


4858 


Qy 


241 



CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 
II I I I I I I I I I I III I I I I II II II I II III I I I II I I II II I M II I I 



TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 
I i I 1 I I I I I I I I I II I II I I I I I I 11 I I I I I I I I I I I I I 



TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

ill I I I I I I II I 

TGTCGATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGC 4857 

CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 
I I I I N III I I I I I I I I I I M I II I I I I I I I I I I M I I I I I II I I I 



CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 
I I I I I I 11 ii I I I I I I I II I I I I I M I II I I I II I I I I I I I I I I II I I I I I I i 
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^^KJXKu. ocoicii xvcauita i^cioiia iwi rvyyu^aiiyjn v/^o-t-r^uo oiiu ocajiv.ii xvcauit ua-v^^-o-r-t... rage j-t ua 

Db 4918 CCGGCACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCA 4977 

Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

I I I I M I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I 
Db 4978 CGCTGCTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATG 5037 

Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

Ml I I I I M I I I II I I II I I I M I I Mill II II I I I II II II I II II I 

Db 5038 CCGCCCTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCC 5097 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I I II I I I II I I I I I I I I II I M I I I II I I M I I I I Mill II II MM I 

Db 5098 GCACGCCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGA 5157 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

I I I I i M I I II I I I I I II M M I II II I I I I I II M I I I II M I I II II 

Db 5158 GAGATCCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCC 5217 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

MM II II M M I I I II II I II II II M II M II II M I II I I I I M II Ml I I 

Db 5218 GGGAACAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTG 5277 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I I I I I I I I I I I I I M M II II II II I M I II M II II II I II II II M 

Db 5278 TGCTGGGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCC 5337 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I I I II II II I I I! 1 II M M II I II II I I I II II I II I II I M I M I 

Db 5338 ACGTGCGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCG 5397 

Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

II II M I II I I I II I M II II I II II I I M II II II I M II II I M I II I 

Db 5398 ACAGCGCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTT 5457 

Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

MMIMIM Mill IIMIIIIII III I 

Db 5458 TCGTGGTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCG 5517 

Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I I I I I M I I II I II II II I I II I II I III I II II I M I II I M 

Db 5518 ATGTGACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCA 5577 

Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

IINNIIIIIIMIII MMMMMMMMMMIIMIMM III 

Db 5578 CACAGTGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGG 5637 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

I I I I I I I 11 II M II I II II II II II II II II M II II II I I II I I 

Db 5638 CTGGCGGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGA 5697 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I I I I I M 11 Mill II M II II I M II 1! II M II M I II I II II II I II II I II II 

Db 5698 TCTTTAAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 5757 

Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

.Mill II MUM II II I M II II II I II III I III 

Db 5758 CACTCTGGACATTGGCTAAGCCTTGA— GAGCCAGGGAGACTTTCTGAAGTAGCCTGATC 5815 

Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

I I I I I I I I I I I I I I I I M I M I I I II II II II I II I II II 
Db 5816 TTTCTAGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGG 5874 
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Qy 1201 TCCTGTAGCAG— GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 1245 

I I III III I I I I M II I I I I I I I I I I I I 

Db 5875 TGTTCCTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 5921 



RESULT 15 

AY923051 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



AY923051 8549 bp DNA linear 

Homo sapiens fucosyltransf erase 1 (galactoside 
2-alpha-L-fucosyltransf erase) (FUTl) gene, complete cds 
AY923051 

AY923051.1 GI:58802462 



PRI 14-FEB-2005 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . . 
1 (bases 1 to 8549) 

Rieder,M.J., Johanson, E . J. , da Ponte^S.H., Hastings, N. C . , 
Ahearn,M.O., Bertucci, C . B . , Wong,M.W., Yi,Q. and Nickerson, D. A. 
Direct Submission 

Submitted ( 07-FEB-2005) Genome Sciences, University of Washington, 
1705 NE Pacific, Seattle, WA 98195, USA 

To cite this work please use: SeattleSNPs . NHLBI HL66682 Program 
for Genomic Applications, UW-FHCRC, Seattle, WA (URL: 
http://pga.gs.washington.edu) . 

Location/Qualifiers 
1. .8549 
/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db_xref-"taxon: 9606" 
repeat_region 1. .387 

/rpt_family="ERVK" 
/rpt_type=dispersed 
388. .687 
/rpt_family="Alu" 
/rpt_type=dispersed 
605 

/frequency="0 .04" 
/replace="t" 
688. .1070 
/rpt_family="ERVK" 
/rp t_t ype=di sper s ed 
706 

/ f requency=" 0.07" 
/replace="c" 
1260 

/f requency="0 . 07 " 
/replace="a" 
1384 

/f requency="0 . 09 " 
/replace="g" 



repeat_region 



variation 



repeat_region 



variation 



variation 



variation 



variation 



variation 



variation 



1386 

/f requency=" 0.01" 

/replace="c" 

1395 

/f requency=" 0 . 02 " 

/replace="a" 

1459 
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variation 

gene 
mRNA 

variation 

variation 

variation 

misc_feature 
repeat_region 
repeat_region 
variation 

repeat_region 

misc_feature 

variation 

CDS 



variation 



/frequency="0 . 42" 

/replace="g" 

1617 

/f requency="0 . 04 " 
/replace="c" 
2000. .6583 
/gene="FUTl" 

join{2000. .2100,3307. .6583) 
/gene="FUTl" 

/product="fucosyltransferase 1 (galactoside 

2-alpha-L-fucosyltransf erase) " 

2021 

/gene="FUTl" 

/f requency="0 . 01" 

/replace="a" 

2125 

/gene="FUTl" 

/f requency="0 . 01" 

/replace="c" 

2319 

/gene="FUTl" 

/ f requency=" 0.04" 

/replace="t" 

2404. .2632 

/gene="FUTl" 

/note="Region not scanned for variation" 

2452. .2619 

/rpt_family="Alu" 

/ rpt_type=dispersed 

2620. .2920 

/ r p t_f ami 1 y = " Al u " 

/rpt_type=dispersed 

2892 

/gene="FUTl" 
/frequency="0 . 38" 
/replace="a" 
2921. .3059 
/rpt_fainily=="Alu" 
/rpt_type=dispersed 
3026. .3262 
/gene="FUTl" 

/note="Region not scanned for variation" 
3274 

/gene="FUTl" 

/f requency="0 . 36" 

/replace="a" 

3309. .4406 

/gene="FUTl" 

/codon_start=l 

/product="fucosyltransferase 1 (galactoside 
2-alpha-L-fucosyltransf erase) " 
/protein_id="AAW82437 .1" 
/db_xref="GI : 58802463" 

/translation="MWLRSHRQLCLAFLLVCVLSVIFFLHIHQDSFPHGLGLSILCPD 

RRLVTPPVAI FC LPGTAMGPNASS SC PQH PASLSGTWTVYPNGRFGNQMGQYAT LLAL 

AQLNGRRAFILPAMHAAIAPVFRITLPVLAPEVDSRTPWRELQLHDWMSEEYADLRDP 

FLKLSGFPCSWTFFHHLREQIRREFTLHDHLREEAQSVLGQLRLGRTGDRPRTFVGVH 

VRRGDYLQVMPQRWKGWGDSAYLRQAMDWFRARHEAPVFWTSNGMEWCKENIDTSQ 

GDVTFAGDGQEATPWKDFALLTQCNHTIMTIGTFGFWAAYLAGGDTVYLANFTLPDSE 

FLKI FKPE AAFL PEWVG INAD LS PLWTLAKP " 

3343 
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ov^WAvu* oc^oicii ivcduild j^cloii:) rv^piii^auwu \jyo^^^\jo oiiu ocaiv^ii xvcduit u»-v/:7-o^^. 



variation 



variation 



variation 



variation 



repeat_region 



variation 



variation 



repeat_region 



variation 



variation 



variation 



variation 



variation 



repeat_region 



variation 



variation 



/gene="FUTl" 
/frequency="0.01" 
/replace="t" 
4458 

/gene="FUTl" 
/frequency="0.36" 
/replace="a" 
4586 

/gene="FUTl" 
/frequency="0.32" 
/replace="c" 
4796 

/gene="FUTl" 

/f requency="0 . 11" 

/replace="a" 

4858 

/gene="FUTl" 

/f requency="0 . 05" 

/replace="a" 

4949. .5256 

/rpt_family="Alu" 

/rpt_type=:dispersed 

5149 

/gene="FUTl" 

/f requency="0 . 32 " 

/replace="a" 

5168 

/gene="FUTl" 
/frequency="0 .25" 
/replace="g" 
5261. .5583 
/rpt_f amily= "Alu" 
/ rp t_type=di sper s ed 
5263 

/gene="FUTl" 
/frequency="0 . 01" 
/replace="a" . 
5271 

/gene="FUTl" 

/f requency="0. 07 " 

/replace="a" 

5272 

/gene="FUTl" 

/ f r equency=" 0.47" 

/replace="a" 

5273 

/gene="FUTl" 

/ f r equency=" 0.47" 

/replace=" " 

5298. .5301 

/gene="FUTl" 

/frequency="0 . 68" 

/replace=" " 

5584. .5724 

/rpt_family="Alu" 

/rpt_type=dispersed 

5614 

/gene="FUTl" 

/f requency=" 0.05" 

/replace="a" 

5700 
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/gene="FUTl" 

/ f requency=" 0.32" 

/replace="c" 
variation 6096 

/gene="FUTl" 

/f requency="0 . 27" 

/replace="t" 
repeat_region 6706. .7013 

/rpt_faitiily="Alu" 

/ rp t_t ype=di spe r s ed 
variation 7179 

/ f requency="0 . 02" 

/replace="a" 
variation 7194 

/frequency="0 . 37" 

/replace="c" 
variation 7218 

/f requency="0. 29" 

/replace="t" 
variation 7251. .7257 

/frequency="0 . 11" 

/replace="" 
variation 7313 

/frequency="0 . 38" 

/replace="c" 

Query Match 63.7%; Score 808.6; DB 5; Length 8549; 

Best Local Similarity 79.6%; Pred. No. 4.1e-196; 

Matches 993; Conservative 0; Mismatches 249; Indels 5; Gaps 3; 



Qy 


1 


CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 

N Ml INI MM M 1 1 1 1 I 1 1 M 1 M 1 1 1 t 1 1 1 1 1 1 I 1 1 I 1 1 

CTGCAGCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTG 


60 


Db 


3301 


3360 


Qy 


61 


TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 
1 > 1 1 M 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 I 1 
TCCTCTCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCC 


120 


Db 


3361 


3420 


Qy 


121 


TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 

Ml II MM II Mill II 1 II 1 MM MM III 

TGTCGATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGC 


180 


Db 


3421 


3480 


Qy 


181 


CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 

MM M II 1 MMIMMMM MM 1 MM Mill III 1 1 

CGGGTACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCT 


240 


Db 


3481 


3540 


Qy 


241 


CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 

MM MMIIIII 1 1 IIIMIIII II 1 1 II M 1 II 1 1 1 11 1 II 1 1 

CCGGCACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCA 


300 


Db 


3541 


3600 


Qy 


301 


CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 

Ml Mill IIIIIIIIIIIIIIM MUM MM IIMMIMIM 1 

CGCTGCTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATG 


360 


Db 


3601 


3660 


Qy 


361 


CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 

Mini 1 1 1 Ml M II II Mill 1 

CCGCCCTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCC 


420 


Db 


3661 


3720 


Qy 


421 


ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 

1 1 M MMIIII MMM MUM Mill II M MM 1 

GCACGCCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGA 


480 


Db 


3721 


3780 
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Ljv^vyx>j^ ocoic/ii ivcauiia i^^iaiis> lui r^yyii^auuit \jyo^-t^Kjo oiiu ocaiv^n xvcauit ua-u:7-o^-r... ra^c j:7 ui j:7 



AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

N II I I I I I I I I M I . I I I II II II I II II I I II I I I I I I I I I I II MM 

GAGATCCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCC 3840 
GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

MM II I M I I M II II I M II I I II II II I M I M I I M II II I II II III I I 

GGGAACAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTG 3900 
TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I Ml Mill II I I II I I I II I II I II I M II I I II II II I II II MM 

TGCTGGGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCC 3960 

ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I II M I I I II I II I I M I I M II I I- II I I I I I II II II I I II I II II II II I 
ACGTGCGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCG 4020 

ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

II M I II II I I II I II I II I I I II I M I II I II I II I I I II I I II I I I I I 

ACAGCGCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTT 4080 

TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 
I I I 11 I I I I I II II II I I II I II II I I II I I I I II II II II II I I II III I 

TCGTGGTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCG 4140 

ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 
i I I M II I II M II II II I I I I I II I III I I M I II II I II I I 

ATGTGACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCA 4200 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 9 60 

M I I I II II II II II II I II I M II II II I I I II II I II II II II I I I t It I II II 
CACAGTGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGG 4260 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

M M I I M I I I II II I I II I II I II I I II M II I M II II II I II I 

CTGGCGGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGA 4320 

TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 
M M I II II I II I I II I II II II II I II I II I I I II I II I I II II II II II II I I II 
TCTTTAAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 4380 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 
Mill II II II II M I II II I I I I I II II Ml I III 

CACTCTGGACATTGGCTAAGCCTTGA— GAGCCAGGGAGACTTTCTGAAGTAGCCTGATC 4438 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

I M I I I M I I I I I II I I I I I I I I I M M II I II I I I II II 
TTTCTAGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGG 4 4 97 

TCCTGTAGCAG— GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 124 5 
I I III I I I I I I I II II II I II I II I II I 

TGTTCCTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 4544 
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Qy 
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4201 


Qy 
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Db 


4261 


Qy 


1021 
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4321 


Qy 


1081 
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4381 


Qy 
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4439 
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1201 


Db 


4498 
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ov^vyxvi^ ocaiv.li xvcauits i^ciaiia lui r^ppiii^auuii v:.onn^uo oiiu ocaioii xvcauit ua-u^-on^-^... r a^c i ux 



SCORE Search Results Details for Application 
09844268 and Search Result us-09-844-268- 
12-rng. 



Score Home Retrieve Application SCORE System SCORE Comments / 

^ Overview FAQ Suggestions 

VJf ^""^^ ^""^ ^^^^^^ '^^^"'^^ "^^^^^ ^^"^ Application 09844268 and Search Result us-09- 
844-268-12. rng. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

CM nucleic - nucleic search, using sw model 

Run on: November 13, 2006, 16:38:38 ; Search time 885 Seconds 

(without alignments) 
9997.495 Million cell updates/sec 

Title: US-09-844-268-12 
Perfect score: 1269 

Sequence: 1 ctcgagccatgtgggtcccc actcagaggaaaccctgcag 1269 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 5244920 seqs, 3486124231 residues 

Total number of hits satisfying chosen parameters: 10489840 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : N_Geneseq_8 : * 

1: geneseqnl980s : * 

2 : geneseqnl990s : * 

3: geneseqn2000s :* 

4: geneseqn2001as: * 

5: geneseqn2001bs:* 

6: geneseqn2002as: * 

7 : geneseqn2002bs: * 

8 : geneseqn2003as : * 

9: geneseqn2003bs:* 
10 : geneseqn2003cs : * 
11: geneseqn2003ds : * 
12 : geneseqn2004as : * 
13 : geneseqn2004bs : * 
14: geneseqn2005s : * 
15 : geneseqn2006s : * 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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Aaxl5872 Fragment 
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Aai72831 FUTl cDNA 
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Aaql3332 GDP-Fuc:b 
Aat61677 Human alp 

Acn372 37 Himan per 
Aaq56908 DNA encod 
Aat76768 Human alp 
Aad4 6825 Human alp 
Aatl2238 Human H-t 
Aaa53820 Human H-t 
Aat63575 Chicken b 
Aat01082 2-Alpha-f 
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Aat01083 2 -Alpha- f 
Aat63576 Mouse H2K 

Adb53234 Primary r 
Aav2 9003 Porcine s 
Aav21639 Pig secre 
Abl66311 Lung cane 
Abl70026 Pancreas 
Abl64735 Lung cane 
Aav58323 Human Sec 
Adq83703 Human tum 
Adq85849 Human tum 
Adq83704 Human tum 
Adq858 50 Human tum 
Aac67966 Rat hepat 
Aad27208 Rat hepat 
Abs57887 Rat cDNA 
Aac67965 Rat hepat 
Aad27207 Rat hepat 
Abs57886 Rat cDNA 
Adb97787 Rat cDNA 
Adb97783 Rat cDNA 
Ach70096 Human gen 
Ach89801 Human gen 
Ach83796 Hioman gen 
Aax77268 X. laevis 
Abz84527 Toxicolog 
Ach76100 Human gen 
Aac03536 Human sec 



ALIGNMENTS 



RESULT 1 
AAX03811 
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ov^vyivx^ ocoiuii ivcauita iyctaiia xui r^.yyuK.aiiKJii u:7o^n^uo aiiu ooai^ii xvc:>uii ua-u^-o^^-^... rage ^ »ji 



CC 

cc 

CC 

cc 
cc 



Ol-APR-1999 (first entry) 

Swine alpha-1, 2-fucosyltransf erase 1 encoding DNA. 

Swine; pig; alpha-1, 2-fucosyltransf erase 1; FUTl; resistance; 
Escherichia coli; infection; oedema; postweaning diarrhoea; 
intestinal disorder; polymorphism; ss . 



/*tag= a 



ID AAX03811 standard; DNA; 1269 BP. 
XX 

AC AAX03811; 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 

OS Sus scrofa. 
XX 

™ Key Location/Qualifiers 
FT CDS 9. .1106 

FT 
XX 

PN WO9853102-A1. 
XX 

PD 26-NOV-1998. 
XX 

PF 20-MAY-1998; 98WO-US010318 . 
XX 

PR 20-MAY-1997; 97US-0047181P . 
XX 

PA (BIOT-) BIOTECHNOLOGY RES & DEV CORP. 
PA (USDA ) US SEC OF AGRIC . 

PA (SWTE-) SWISS FEDERAL INST TECHNOLOGY ZURICH 
XX 

PI Bosworth BT, Vogeli P; 
XX 

DR WPI; 1999-059746/05. 
DR P-PSDB; AAW30630. 
XX 

PT New method of identifying swine that are resistant to intestinal 

PT colonisation by Escherichia coli - comprises use of genetic polymorphic 

PT markers, used for breeding swine resistant to Escherichia coli-related 

PT diseases. 

XX 

PS Claim 6; Fig 1; 35pp; English. 
XX 
CC 



A method has been developed for the identification of swine that are 
CC resistant to intestinal colonisation by E. coli. The method comprises 
CC determining whether a genetic polymorphism associated with resistance to 
CC colonisation is present in a swine sample, and then inferring that the 
CC swine is resistant if it is homozygous for the polymorphism. The method 
CC uses the swme alpha-1, 2-fucosyltransf erase (FUTl) polymorphism. The 
CC present sequence encodes swine FUTl. The method enables the breeding of 
CC swine that are resistant to E. coli-related diseases. This method 
CC comprises breeding swine that have a genetic polymorphism in the FUTl 
CC gene. More particularly, the identification method identifies swine that 
are resistant to E. coli-related intestinal disorders if, in a sample 
taken, the only nitrogen base at residue 307 in the FUTl gene is adenine 
Larger amplified fragments from the assay can be used for RFLP analysis 
and the assay itself is used as a basis for a kit, applied to swine of 
any age, in detecting polymorphisms associated with E. coli F18 
CC receptors. The polymorphisms are useful in developing drugs to treat 
CC swine with E. coli-related diseases. However, a mutated form of the 
CC porcine FUTl gene may interfere with the normal enzyme and prevent it 
CC from producing the intestinal receptor for F18. The detection of 
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ovxWAVj^ ocon^ii ixcauita i^ciaiia lui r^yyn\^auyju u^o-t-r^uo aiiu ocaiwi ivcauii, ua-v>r-o-fi^-^... rage ui 

CC polymorphic markers in the method disclosed enables the detection and 
CC treatment of E. coli-related intestinal diseases in swine, where there 
CC has been no success using antibiotics due to unsuccessful prophylaxis 
XX 

SQ Sequence 1269 BP; 219 A; 413 C; 362 G; 275 0 U; 0 Others- 
Query Match 100.0%; Score 1269; DB 2; Length 1269; 
Best Local Similarity 100.0%; Pred. No. 3.8e-285; 

Matches 1269; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I M I M I I I I I I I I I I I I 

Db 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

Qy 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

I I I I I I I M M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

Qy 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

I I I I I I I I I I I M M I M I M I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I 

Db 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

I I I I I I I I I M I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I M I I I I I I 

Db 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

I I M I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
Db 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

I I I I I I I I I I I I i I M I I I I I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I M I I M I I I I I I I I I I 
Db 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 
Db 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I M I I I I I I I I I I I j M I I I i I I I 
I^b 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

I I i i I I i I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M Ml I I I I I I I I I I M I I I 

Db 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I I I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I i I I I I IJ I I I I I I I I I I I I 
Db 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I I I I N I I i I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

I I I I I I i I I I I N I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 
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ov^v/xvu, ocai\^u ivc&uiia J^ctoiia lsji r^ypn^aiiKJH u^o-t-r^uo oiiu ocan^u ivcauit ua-v:7-o-r-r-z,.. . r a^c J 



TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 
I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I i I I I I I I I I I I I I I 
ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I 
TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 
I I t I I I I I I I I I I I I I i I I I I I I I I I I I I I I M I I I I I I M I I j I I I I I I I I I I I I I I I I 
CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I M I I 
TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

I M I M I I i I I M I M I I I I I I I I I I I I I I I I I I j I I I M M I I M I I I t I I I I I I I M I 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 
I i I I M t I I I I I I I I I I I I I I I I I I I I I I I [ I I I I I I I I I I I I I M I I I I I I I I I I I I I I 
AACCGAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 
I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 



I I I f I I I I I 



RESULT 2 
AAX15872 

ID AAX15872 standard; DNA; 1269 BP. 
XX 

AC AAX15872; 
XX 

DT 13-MAY-1999 (first entry) 
XX 
DE 
XX 

KW Swine alpha (1,2) fucosyl transferase; FUTl; Escherichia coli; resistant; 

KW E. coli-associated intestinal disorder; E coli infection; ss . 

XX 

OS Synthetic. 
OS Sus sp. 
XX 

FH Key Location/Qualifiers 
FT CDS 9. .1106 

FT /*tag= a 

XX 

PN WO9853101-A2. 
XX 

PD 26-NOV-1998. 



Ov 


781 


Db 


781 


Ov 


841 


Db 


841 


Ov 


901 


Db 


901 


Ov 


961 


Db 


^ O J. 


Ov 


1021 


Db 


J. U ^ X 


Ov 
vy 


1081 


Db 


X U O X 


Ov 


X X *i X 


uu 




vy 


X ^ U X 


Db 


1201 


Qy 


1261 


Db 


1261 



Fragment of pHT83xF encoding swine alpha (1,2) fucosyltransferase , 
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ov^wivl;, ocoh^u xvcauita xyctoiia lui ^yyiiK^aiiun v^o-t-r-^wo oiiu ocan^ii ±vc&uil ua-v^-o-t^-^... ra5c u ui -rw 
XX 

PF 20-MAY-1998; 98WO-US010259 . 
XX 

PR 20-MAY-1997; 97US-004 7181P . 
XX 

PA (BIOT-) BIOTECHNOLOGY RES & DEV CORP. 
XX 

PI Bosworth BT; 
XX 

DR WPI; 1999-131692/11. 

DR P-PSDB; AAW97356. 
XX 

PT identifying swine genetically resistant to E. coli associated diseases - 

PT using PCR-RFLP to assay for polymorphisms in the alpha (1,2) 

PT fucosyltransf erase 1 gene. 
XX 

PS Claim 6; Fig 1; 19pp; English. 
XX 

CC The present sequence encodes swine alpha(l,2) fucosyltransf erase (FUTl). 

CC The specification describes methods relating to Escherichia coli- 

CC resistant swine. One of the methods for identifying a swine resistant to 

CC E. coli-associated intestinal disorders, comprises determining whether 

CC the base at 307 of alpha (1,2) fucosyltransf erasel gene (FUTl) is adenine 

CC (sic), in which case the swine are resistant. The porcine FUTl 

CC polymorphisms can be used to develop drugs for the treatment of swine 

CC having E. coli-associated disease. The methods can also be used in 

CC breeding programmes to identify swine with resistance to E coli infection 

XX 

SQ sequence 1269 BP; 219 A; 413 C; 362 G; 275 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1269; DB 2; Length 1269; 

Best Local Similarity 100.0%; Pred. No. 3.8e-285; 

Matches 1269; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

I N I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I t I I I I I I i I I I I I I I I 
1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

Qy 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

I I I I I I I i I I I I I I M I I I I I I I I I I I M I I I I I t I I I I I I I I I I I I I I I I I I I I I I I i I 
61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

Qy 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

■II I Illllll I I I I I I I I I I I I I I I I I I I I i I I I I I 

121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

I N I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I M I I I I I I I I I I I M I I I M I I I I 

181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 
Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

IIIMM NMIIIIIIIMIIIIIllMlllllllllltllllllllllllll 

Db 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

IIIIIIINilllllllllllllllllllllliliiiiiiiiiiiMIIIIIMMIlll 

Db 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

NIMIIIIIIIIIMIIMIIIIIIIIIIIIIIMIIIIIIIlllliMiiiiiiiiii 

Db 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 
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ov.wxvi:, ocoioii xvcauita x^ctaii:* lui r^yyu^aiiyjii v/3?u-tnz,uo aau ocan^u Avcauit ua-v:7-o^n-z.,. 



ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 

I IIIIMMMIIIIIMIIMIIMliiiiiiiiiiiiiiiM 

ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 



Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 . 


Db 


1141 . 


Qy 


1201 ' 


Db 


1201 ' 


Qy 


1261 1 


Db 


1261 i 



480 
480 



AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

IIIIIMIIIIIIIIMIIllMIMIIIIIIIIIMIMIIIIIIIIIIIIIII 

AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 
GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

lllllll IMtlllllllMlllllllliiiiiiiiiiiiiiiiiiiiiiii 

GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 
■ I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ,,,,, I , 

TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 
ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

Mill IIMIMIIIIIMIIIIMIIIIIIIIIIIMIlliiMillll 

ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 
ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

I I I I I I I I I I I I I I I I I I t M I I I I I I t I I I I I M I I I I I M I I I I I I I M I I I I I I I I I 

ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 
NlilllMMIIIIII M I M I M I M I I I I I I I I M I I I I I I I I I I I I I I I I 

TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 
> I I I t I M I I I I I I I I I I I I i M I I I I I I I I I I I I I M I t t I I I I 1 I I I I I i I i I I i 1 I I 
ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

KIMMMIIMIill lllllllllllllllltllllllllllllMIIII 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 
CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

IIIIIIMMIIIMIIIIIIII M I I I I I I I I I I I I I I I M I I I I I 11 I I I I I I 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 
TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

KIIIMIIIM I iillMllllllllllllllllilllllllMlll 

TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

" Illlllllllll I I I I I I I I I I I M I I I I IIIIIIIIM 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 



AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 
>■ I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I , I 11 I 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 



1140 
1200 
1200 



TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 
■IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIilliiiiiiiii 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 



ACCCTGCAG 1269 
I I I I I I I I I 
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lJ^^vyAVl_> ocaiv^ii xvc^uiid i^ctcuia lui r^yyu^auKJii u:7onn-^.uo miu ocaii^ii 



RESULT 3 
AAZ94417 

ID A7V294417 standard; DNA; 1269 BP. 
XX 

AC * AAZ94417; 
XX 

DT 18-JUL-2000 (first entry) 
XX 

DE Pig alpha-1-2 fucosyltransf erase FUTl gene. 
XX 

KW Alpha-1-2 fucosyltransf erase; FUTl; pig; polymorphism; Escherichia coli; 

KW resistance; ss. 

XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 

FT CDS 9. .1106 

FT /*tag= a 

FT /transl_except= {pos:726. .728, aa:Ala) 

FT variation replace ( 315, A) 

FT /*tag= b 

/note= "adenine is substituted for guanine in resistant 
FT pigs" 
XX 

PN WO200016641-A1. 
XX 

PD 30-MAR-2000. 
XX 

PF 17-SEP-1999; 99WO-US021408 . 
XX 

PR 18-SEP-1998; 98US-00151592 . 
XX 

PA (BIOT-) BIOTECHNOLOGY RES & DEV CORP. 

PA (USDA ) US SEC OF AGRIC . 

XX 

PI Bosworth BT, Ridpath J, Wiseman B; 
XX 

DR WPI; 2000-283404/24. 

DR P-PSDB; AAY79302. 
XX 

PT Improving weight gain in swine using swine genetically resistant 

PT Escherichia coli and feeding swine high levels of plant based protein 

XX 

PS Disclosure; Fig 1; 33pp; English. 
XX 

CC This is the nucleotide sequence of the pig FUTl gene encoding alpha-1-2 

CC fucosyltransf erase (see AAY7 9302) . A polymorphism at position 307 of the 

CC coding region is associated with susceptibility to F18 Escherichia coli 

CC colonization; pigs homozygous for adenine at position 307 are resistant 

CC to colonization, while heterozygous animals and animals homozygous for 

CC guanine at position 307 are susceptible to colonization. A claimed method 

CC for improving weight gain in pigs involves selecting animals that are 

CC genetically resistant to *E. coli colonization and feeding these animals 

CC high levels of plant-based protein concentrate. A claimed method for 

CC preventing F18 E. coli colonization in swine, especially swine that are 

CC genetically susceptible to F18 E. coli colonization, involves replacing 

CC some or all of the plant-based proteins in the diet with animal-based 

CC proteins. The polymorphism in the FUTl gene is also useful for developing 

CC drugs to treat swine that have E. coli associated disease. The 

CC polymorphism can be detected using PCR-RFLP tests (see also AAZ94418-19) 
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ij^KjL\i^ ocaiL.li xvcauita i^cuaiia lui i-Lppin^auua u:7onn^uo oiiu ocaiL.ii xvcauit ua-v^'-o-r^-z.... x a^c 
XX 

SQ Sequence 1269 BP; 219 A; 413 C; 362 G; 275 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1269; DB 3; Length 1269; 

Best Local Similarity 100.0%; Pred. No. 3.8e-285; 

Matches 1269; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


CTCGAGCCATGTGGGTCCCCAGCCGCrGrrArrTrTnTrTr;aprT'nr'rT^rr''r7\r'rpr'rTinrTir' 

I N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 


60 


Db 


1 


60 


Qy 


61 


T T T TAGCAGCAATT T TC TTCr TG A AP r;T r TAT r a zv r: nrr T p T T rpqi 7\ r A r- rnnn m oiTx 7\ 

1 1 1 1 N 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 


120 


Db 


61 


120 


Qy 


121 


TGC TGGC CC TGTGTC C AG AC r A T A AP r;T rif^T A TP A T r TP r r r Tr* r n r- A 7\ Ti m m m/-- 

1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 I 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 
TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 


180 


Db 


121 


180 


Qv 


181 


CGGGCAPGPPf^nT AP APPPPZiaPf^PPTrTT" ATiTpr'TT'frr'r'r' a T\r*m\ m/-*/^m/-'/-«/-»m/-.r^ mmrnm 
v-.vjvj\jv^rtVrf\jv.,v^vjo x^cia/i-X X L-v^ X trXL-UCAAGCATCCTGCCTCC TTTT 

1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 


240 


Db 


181 


240 


Ov 


241 


^ o vjrt\^ 1 ijvjau X a X X X AC V- u trtrA X (jQr c u (jg t t t g ggaac c agat gggac agt at gc c a 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 


300 


Db 


241 


300 


Qv 
Db 


301 
301 


1 1 1 1 1 i 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 


360 
360 


Qy 


361 


CCGTCC TGGCP PP POTH-TTPPrtP A TP nr* rr" Tr'r'r' mr* T^r" n m/^r* r'r^ r*/^/^ TV ^ m ^ /-^ TV ^ ^ « 
>-v-vj J. v^v.- J. vjvjrv-.v^v^(^i,^vj ic*^^jU/\xuAotrL. XtjL.L. XtrXCCTGGCGCCCGAGGTAGACAGGC 

1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II , 

CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 


420 


Db 


361 


420 


Qy 


421 


ACGCTCCTTGGCGGGAF^Tn^nAr^^PTTP APfznPTrr'nTir'nnr'r'r* An/" Amm7\mr^/^r«/^7\i^mm-n 

illllllllll t Illllllllllltlllllllll 

ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 


480 


Db 


421 


480 


Qy 


481 


AGGAGC C C TGGC TGAAGC T CAP C T T P P P P T r:;p T r r TP r a r r T fp r* fp fpr* n A r» A m o /" 

■Nil 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 


540 


Db 


481 


540 


Qy 


541 


GGGAGCAGATCCGCAGCGAGTTCACCCT GP AP GAPP AP p T TP Gftir a nr nrpr* r-r* A n r-r^ /"/^ 

1 1 1 1 M i 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 t 1 

GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 


600 


Db 


541 


600 


Qy 


601 


TAG TGAGTCAGTTCCGTCTACCCCGCAC AGGGG APP GPPPP AGP APPTTp r Tr rrr-r r rrnr* 

IN illillMIMIMIIIIIIIMIIIIMIIIIMlMliM II 

TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 


560 


Db 


601 


660 


Qy 

Db 


661 
661 


ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 

HIM 1 Mill III IIIMIIII III 11 II Mil II 

ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 


720 
720 


Qy 


721 


ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 

IMMMI illlMMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIII 

ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 


780 


Db 


721 


780 


Qy 


781 


TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 

MMIMMMIIIIIIIMIIIIllliiinii Ill MM MM MM INI 


840 
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781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

i I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I f 

ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I 
TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

I I I I I M I I I I I I I I I I I I I I I I I I I I t M I I I I I M I I I I I i I I I I I I I I I I M I I i I I 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 
TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

N M I I I I I I I I I I M I I I I I I I I I I I I I I M M I M I I I I I I M I I I I I I I I I I I I I I I 

TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 
CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

I I M I I I I M I I I I I I I I I I M I I I I I I I t M I I I I M I I I I M I I I I I I M I I I I t I I i 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 
I I I N I I I i I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

I I I I I I M I I I I I I I I t I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I M I I I I I 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

ACCCTGCAG 12 69 
I I I I I I I I I 
ACCCTGCAG 12 69 

RESULT 4 
AAI72831 

ID AAI72831 standard; cDNA; 1269 BP. 
XX 

AC AAI72831; 
XX 

DT 22-JUL-2002 (first entry) 
XX 

DE FUTl cDNA. 
XX 

KW Gene; pig; swine; alpha (1,2) fucosyl transferase; FUTl; weight gain; F18; 

KW Escherichia coli; ss . 

XX 

OS Sus scrofa. 
XX 

Key Location/Qualifiers 
FT CDS 9. .1106 

FT /*tag= a 

FT /product= "FUTl" 

FT mutation 315 
FT /*tag= b 

/phenotype= "Confers resistance to F18 E. coli" 

XX 

PN US6355859-B1. 
XX 

PD 12-MAR-2002. 



Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 
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ov^vyivij, ocaii.li ivcauita x^ctoiia lui rYppiiuatiuii u^onnz,uo aim ocan^ii xvcauit u»-u^-o^n... ra^c i i ui 



XX 

PF 18-SEP-1998; 98US-00151592 . 
XX 

PR 20-MAY-1997; 97US-004 7181P . 
XX 

PA (BIOT-) BIOTECHNOLOGY RES & DEV CORP. 

PA (USDA ) US SEC OF AGRIC . 

XX 

PI Bosworth B, Ridpath J, Wiseman B; 
XX 

DR WPI; 2002-391652/42. 
DR P-PSDB; AAB47995. 
XX 
PT 
PT 



Feeding swine genetically susceptible to F18 Escherichia coli 
colonization with a low plant protein based diet increases weight gain 
PT and lowers intestinal disease associated with E. coli infection 
XX 

PS Claim 1; Col 13-18; 9pp; English. 
XX 

CC This sequence represents the swine alpha (1,2) fucosyltransf erase (FUTl) 
CC gene. A FUTl gene in which there is a base other than adenine at position 
CC 307, may be used for improving weight gain in swine that are genetically 
susceptible to F18 Escherichia coli. The weight gain may be activated by 
feeding a diet of at least 40% animal based proteins. The feeding method 
CC is used to control F18 E. coli associated intestinal disease in swine 
XX 

SQ Sequence 1269 BP; 219 A; 414 C; 361 G; 275 T; 0 U; 0 Other; 

Query Match 99.6%; Score 1264.2; DB 6; Length 1269; 

Best Local Similarity 99.8%; Pred. No. 5e-284; 

Matches 1266; Conservative 0; Mismatches 3; Indels 0; Gaps 



CC 
CC 



Qy 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 

' I > I I I I I I I t I I I I I I I M I I I I I M M I I I I I I I I I I M I I M M I I I I I I I I I I M 

CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 



0; 

60 



Db 1 



61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 
I > I I I I I I t I I i I I I I I I I I I I I I I I i I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 



Db 61 



Qy 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

■■IMIIMIIItlllllllllllllMIIIII IIIIIIMII Illllll 

TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 



Db 121 



Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

' N I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I j I I I I I II I I ,, I ,, I I I , 

CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 



Db 181 



Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

' I i I I I I M I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I M I I I 

CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 



Db 241 



Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

I I I I I i M I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 
CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 



Db 301 



Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 
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ov^v/xv^ ocaiv.li ivcauita i^ctaiia iwi /T.^piiv.auuii xj^o-t-t^yjo oiiu ocaiv.ii xvcauit ua-w^-o-t^... x a^c i^ ux 

IIIMIMIIIIIIIMIIIIIIMIIIIIIIIIIIIIlllliMIIMIIIIMItlll 

Db 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

IINIIIIMIIMIIIIIIIIII IIIIIIMIIIIIMIIIIIIMIIIIII 

Db 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

I N M I I I M I I I I I I I I I I I I I I I I I I I I I I IMIIIIMIIIIIIIIIIIII 

Db 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

*l I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I t I I I I I I I I I I I I I I I I M M I I 

Db 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I I N I I I I I I I I I I I I M I I I M I I I M M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

II MIIMIIIIIIMIIIIIMIIIIIIIlllliiiiiiiiiiiiiii 

Db 721 ACGGCGCTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I I N I M I i I M It I I I II II I M I I I I I I [ I I I I t I I I I I I I I I I I I I I I I I I I 

781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I I I I I I I I I I M I f I I I I I I I M I I I I I I I I I I M t I I I I I I I I I I I I I I I I I I I I M I I 
Db 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

N I M I I I I i t I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I M I I M I I 

Db 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I [ t I I I I I I I I 
Db 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I i i i M I I I I I M I I I M I i I I I I I t I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I 
Db 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

I I I I I I I I I N I I I I I I I I I I I M II I I I I t I I I I I i I I I I I I I I I I I I I I I I I I I I I M 
Db 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

I I N I I M I I I I I I I I I I I M I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCCAGTAACTTCCGGAGATGCTGGTGG 1200 

Qy 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

Qy 1261 ACCCTGCAG 1269 

I I I I I I I I I 
Db 1261 ACCCTGCAG 1269 

RESULT 5 
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i^^vyxvu, ocoioii xvc:>uiLd xy^uuid lui /-\.ppii\^atiuii v:7o^^z,uo oiiu ocai\^ii xvcauii ua-v7:7-o^^.., A'a^c u ux 



AD058598 

ID AD058598 standard; DNA; 3315 BP. 
XX 

AC AD058598; 
XX 

DT 15-JUL-2004 (first entry) 
XX 

DE Porcine alpha-1, 2-fucosyltransf erase (FUTl) DNA. 
XX 

KW Detection method; resistance gene; porcine; epidemic diarrhoea; 

KW Escherichia coli F18 receptor; alpha-1; 2-fucosyltransf erase; FUTl; pig; 

KW ds. 

XX 

OS Sus scrofa. 
XX 

PN KR2002080504-A. 
XX 

PD 26-OCT-2002. 
XX 

PF ll-APR-2001; 2001KR-00019282 . 
XX 

PR ll-APR-2001; 2001KR-00019282 . 
XX 

PA (CHOK/) CHO K K. 

PA (CHOI/) CHOI Y J. 

PA (LEES/) LEE S H. 
XX 

PI Cho KK, Choi YJ, Kim CU, Lee SH; 
XX 

DR WPI; 2003-325954/31. 
XX 

PT Detection of resistance gene against porcine epidemic diarrhea. 
XX 

PS Disclosure; Fig 1; 12pp; Korean. 
XX 



CC The present invention relates to a method of detection of a resistance 

CC gene against porcine epidemic diarrhoea by examining resistance and 

CC susceptibility against Escherichia coli F18 receptor. The detection 

CC involves preparing primers for detecting a mutation of the FUTl gene, and 

CC detecting a pig, which shows resistance against porcine epidemic 

CC diarrhoea, by PGR using the prepared primers and a test sample collected 

CC from a pig. Also disclosed are sequences of the primers used for the 

CC detection. The present sequence represents porcine FUTl DNA. 

XX 

SQ Sequence 3315 BP; 648 A; 991 C; 901 G; 775 T; 0 U; 0 Other; 



Query Match 98.2%; Score 1245.8; DB 11; Length 3315; 

Best Local Similarity 99.4%; Pred. No. 1.2e-279; 

Matches 1261; Conservative 0; Mismatches 7; Indels 1; Gaps 1; 

Qy 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

II I I I II II I I I I I II I I I II II I I I I I I I I I I I I II I I 11 I I II I I I I II II I I I I I 
Db 601 CTGCAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 660 

Qy 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

M II I I I I II I I I I I I I I I M I I I M I I i I I I I I M I I I I I I I I I I I I I I I I M I I I I I I 

Db 661 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 720 

Qy 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 780 
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t^^^AVLi ocaiL/ii xvcdUitd x^CLaxid lui rvppin^auuii u:70Tnz.uo mm ocaivii xvc^duii ua-w:7-on^, 



Qy 


181 


CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 


240 






1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 III 




Db 


7ft 1 

/ O J. 


L-^j^jU^UAUtjL^utjtji AL-AUUUOAAUCjUt^ iUU i tjiuuOAAGCATCCTGCCTCCCTTT 


840 


Qy 


241 


CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 


300 






1 1 1 1 II II II i 1 1 1 II 11 II 1 II 1 II 1 II 1 1 II 1 1 M II M 1 II 1 1 M 1 II M II M II 1 






O 4 X 


UOtj^:r(jAUUiC:rCjACiAi i IAUULCjCjAI GGCCGG 1 i iCirGGAACCAGATGGGACAGTATGCCA 


900 


Qy 


301 


CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 


360 






1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M II 1 






^ U X 




960 




o ox 


t^UGlOCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 


420 






1 1 1 1 1 1 1 1 1 II M 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 II M 1 M 1 1 1 1 




JJJU 


y DX 


CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 


1020 


Qy 


421 


ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 
1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I r 1 t I t 1 1 1 1 1 1 1 t 1 1 1 


480 


UD 


XUZ X 


i f 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 [ M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t M 1 

ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 1080 


Qy 


481 


AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 


540 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 M 




JJJL) 


1 HQ 1 
XUoX 


AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 


1140 


Qy 


541 


GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 


600 






M 1 1 1 1 1 1 1 1 1 1 1 11 1 II 1 1 1 1 1 1 II M 1 1 M II 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 M 11 1 M II 




DD 


11/11 

1X41 


GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 


1200 


Qy 


601 


TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 


660 






1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 M 1 1 M 1 1 II 1 II M 1 1 1 1 1 1 1 M M 1 1 1 






izUl 


TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 


1260 


Qy 


661 


ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 t 1 1 1 1 1 1 1 1 I I 1 1 I 1 I 1 

1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M M 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 M 1 M 1 1 1 




JJJL) 


IZ ol 


AUGiGUGUQ^GCGGGGAL-iAlCiGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 


1320 


Qy 


721 


ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 


780 






1 IN 1 1 1 1 M 1 1 M 1 1 1 1 1 1 M M 1 1 1 M 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 






X X 


ATGGCGCTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 


1380 


Qy 


781 


TTGTGGTGACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 




ijjj 


X o 0 X 


TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 


1440 


Qy 


841 


ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 


900 






i 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 M 1 1 1 1 II II 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 






1 d 4 1 
X 4 4 X 


ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 


1500 


Qy • 


901 


TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 


960 






1 1 1 1 1 1 1 1 M M M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




LlkJ 


X ^ U X 


TGCAGTGCAACCACACCATCATGACCATTGGCAGCTTCGGCTTCTGGGCCGCCTACCTGG 


1560 


Qy 


961 


CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 


1020 






1 1 1 N M II 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 t 1 M 




Db 


1561 


CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 


1620 


Qy 


1021 


TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 


1080 






MMIMIMIMMIIIII 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


1621 


TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 


1680 
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Qy 


1081 


Db 


1681 


Qy 


1141 


Db 


1741 


Qy 


1201 


Db 


1801 


Qy 


1261 


Db 


1860 



ocoiL^u ivcauita i^cuuia lui rvppuoauuii v^^o-rn^uo cuiu ocan^ii ivcauit ua-^^-on^... ra^c 1^ ui -tu 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

I N I I I I I I I I I I I I I I I I I I I I i IIIIIMIIIIIIIIIIIIM INI 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1740 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

llllllll Illi IIIIIMIIII I I I I I I I I I I I I I I I M I I I I I I I 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCCAGTAACTTCCGGAGATGCTGGTGG 1800 

TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGTIA 12 60 

IIINIIIII IMIIMIMMIIIIIIIIIIIIIIIIIIIIIIIMlliliMIIIII 

TCC TGTAGCA- GCTGGACAC TTAT TTCAAGAGTGAT TC TAATTGGC TGGAC TC AGAGGAA 1859 

ACCCTGCAG 12 69 
I M I I I I I I 



RESULT 6 
AAV21640 

ID AAV21640 standard; cDNA; 1098 BP. 
XX 

AC AAV21640; 
XX 

DT 08-JUL-1998 (first entry) 
XX 

DE Pig H transferase encoding cDNA. 
XX 

KW Pig; H transferase; chimeric; glycosyltransf erase; gene therapy; 

KW transplantation; ss. 

XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 
FT CDS 1. .1098 

FT /*tag= a 

FT /product= "H transferase" 

XX 

PN WO9805768-A1. 
XX 

PD 12-FEB-1998. 
XX 

PF Ol-AUG-1997; 97WO-AU0004 92 . 
XX 

PR 02-AUG-1996; 96AU-00001402 . 
PR 21-AUG-1996; 96US-0024279P . 
XX 

PA (AUST-) AUSTIN RES INST. 
XX 

PI Mckenzie IFC, Sandrin MS; 
XX 

DR WPI; 1998-159170/14. 
DR P-PSDB; AAW53102. 
XX 

PT Nucleic acids encoding chimeric glycosyltransf erases - used for altering 
PT carbohydrate levels on the surface of cells, useful in gene therapy and 
PT transplantation. 
XX 

PS Example 3; Fig 7; 51pp; English. 
XX 
CC 



The present sequence encodes pig H transferase used in an example of the 
CC present invention. The present invention describes nucleic acids (NA) 
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^^y-^i-"^^ ocoiwi ±vc;auiL:> x^^iaiL:> rx^jyii^ausjii vj7o*m^uo aiiu ocaii^ii xvc^duit ua-vy:7-o^*^... A a^c lu ui 



CC encoding a chimeric glycosyltransf erase . The NAs comprise a catalytic 

CC domain of a first glycosyltransf erase (GT) and a localisation signal of a 

CC second GT, whereby when the NA is expressed in a cell and where the 

CC chimeric enzyme is located in an area of the cell where it is able to 

CC compete for substrate with a second GT, resulting in reduced levels of a 

CC product from the second GT. The NAs can be used to produce cells and 

CC organs with desired glycosylation patterns. Products and methods of the 

CC present invention can be used to reduce the levels of undesirable 

CC epitopes in cells, tissues or organs which may be used in transplantation 

CC or gene therapy 

XX 

SQ Sequence 1098 BP; 181 A; 367 C; 312 G; 238 T; 0 U; 0 Others- 
Query Match 86.3%; Score 10 94.8; DB 2; Length 1098; 
Best Local Similarity 99.8%; Pred. No. 1.2e-244; 

Matches 1096; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 9 ATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTAGCA 68 

I I I I M I I M I I t I I I I I M I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 1 ATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTAGCA 60 

Qy 69 GCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTGGCC 128 

I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
Db 61 GCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTGGCC 120 

Qy 129 CTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGCACG 188 

I I I I M I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IN I I I I 
Db 121 CTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGCACG 180 

Qy 189 CCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGGACC 248 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 CCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGGACC 240 

Qy 24 9 TGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTG 308 

I I I I I I M M I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I M I M I t I I I I I I I I I 
Db 241 TGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTG 300 

Qy 309 GCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTG 368 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTG 360 

Qy 369 GCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCT 428 

I I I I I I I i I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 GCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCT 420 

Qy 429 TGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCC 488 

i M I I I M I I M I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I i I I I I I M 

Db 421 TGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCC 480 

Qy 489 TGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAGCAG 548 

M I M I I M I I I I I I I I I I I I I I I I I I I I I I n I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 481 TGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAGCAG 540 

Qy 549 ATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTGAGT 608 

IIIMIII ilMIIIIIIII I I I I I I I I I I I I I I I MINI 

Db 541 ATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTGAGT 600 

Qy 609 CAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTGCGC 668 

I i M II I M I I I II I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I 

Db 601 CAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTGCGC 660 
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Qy 


669 


CGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGCCGT 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

CGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGCGCT 


728 


Db 


661 


720 


Qy 


729 


TACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTGGTC 

1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 M 1 1 

TACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTGGTC 


788 


Db 


721 


780 


Qy 

Db 


789 
781 


ACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTGATC 

NIIIIIIIMIIIIIIIIIIIIIIMIIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTGATC 


848 
840 


Qy 


849 


TTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAGTGC 
1 1 1 1 1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAGTGC 


908 


Db 


841 


900 


Qy 


909 


AACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGTGGA 

N M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 

AACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGTGGA 


968 


Db 


901 


960 


Qy 


969 


GATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTTAAA 
1 1 1 > 1 1 1 1 1 M t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 i 1 1 
GATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTTAAA 


1028 


Db 


961 


1020 


Qy 


1029 


CCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTCCAG 
N 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTCCAG 


1088 


Db 


1021 


1080 


Qy 


1089 


ATGTTGGCTGGGCCTTGA 1106 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 
ATGTTGGCTGGGCCTTGA 1098 




Db 


1081 





GDP-Fuc:beta-D-galactoside alpha (1, 2 ) -fucosyltransf erase gene. 



RESULT 7 
AAQ13332 

ID AAQ13332 standard; DNA; 8174 BP. 
XX 

AC AAQ13332; 
XX 

DT 25-MAR-2003 (revised) 
DT 07-NOV-1991 (first entry) 
XX 
DE 
XX 

KW Glycosyltransf erase. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 4686. .5783 

FT /*tag= a 

XX 

FN WO9112340-A. 
XX 

PD 22-AUG-1991. 
XX 

PF 14-FEB-1990; 90US-00479858 . 
XX 

PR 14-FEB-1990; 90US-00479858 . 
PR 14-FEB-1990; 90US-00480133 . 
PR 12-DEC-1990.; 90US-00627621 . 
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tJ^^Avij, ocai\^ii xvcduiLd j^cioiid lui rrppiicauuii v:7o*t^z-uo oiiu oc^muii ivcauii. ua-u:7-o^^... I'a^c lo 



XX 

PA (UNMI ) UNIV MICHIGAN. 
XX 

PI Lowe JB; 
XX 

DR WPI; 1991-267151/36. 

DR P-PSDB; AAR13751. 
XX 

PT Isolation of gene conveying post-translational characteristic - e.g. the 

PT presence of soluble or membrane bound oligo or polysaccharide o.r 

PT glycosyltransf erase . 
XX 

PS Disclosure; Fig 3; 155pp; English. 
XX 

CC The DNA encodes a protein sequence capable of functioning as a GDP- 

CC Fuc: [beta-D-Gal alpha(l,2)- fucosyltransf erase . The sequence coded by 

CC nucleotides 4782 -.5780 represents the functional protein. The enzyme 

CC produced by the DNA sequence can be used in enzymatic fucosylation of 

CC chain-terminating galactose residues on lactose- amine or neolacto type 

CC beta-D-galactoside to alpha-2-L-fucose residues. See also AAQ13330- 

CC Q13333. (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 8174 BP; 1628 A; 2229 C; 2322 G; 1995 T; 0 U; 0 Other; 



Query Match 63.7%; Score 808.6; DB 2; Length 8174; 

Best Local Similarity 79.6%; Pred. No. 7.7e-178; 

Matches 993; Conservative 0; Mismatches 249; Indels 5; Gaps 3; 

Qy 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

II I I I I I I I I II III II I I II II I I I I I III II I I I I I I I I I II I I M I 
Db 4678 CTGCAGCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTG 4737 

Qy 61 TTTTAGCAGCT^TTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

I I I I I I I I I I I I I I I I I I I I M I I I I I I II I I M I I I M 

Db 4738 TCCTCTCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCC 4797 



Qy 121 TGCTGGCGCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

M I I I I I I I I I I I I I I I I I I I I I I I I i I I t I I I I I I I I M I I I I 

Db 4798 TGTCGATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGC 4857 

Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

t I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 4858 CGGGTACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCT 4917 



Qy 


241 


CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 

MM 1 II 1 1 1 1 M 1 1 II 1 1 M 1 II II 1 M II 1 II 1 1 II M 1 II 1 M M II M 1 

CCGGCACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCA 


300 


Db 


4918 


4977 


Qy 


301 


CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 
M II 1 1 1 1 II Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 
CGCTGCTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATG 


360 


Db 


4978 


5037 



Qy 


361 


CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 


420 






1 II 1 1 1 1 1 1 1 M II M M 1 1 1 1 II 1 1 1 1 1 1 II 11 1 1 1 M II II 1 1 II 1 1 




Db 


5038 


CCGCCCTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCC 


5097 


Qy 


421 


ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 


480 






1 1 M II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 II II 1 II II 1 1 1 1 1 II II 1 1 1 1 1 




Db 


5098 


GCACGCCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGA 


5157 


Qy 


481 


AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 


540 
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Db 


5158 


II II I 1 il 1 IIMMIIIIIIII Mill llllllllll 

GAGATCCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCC 


5217 


Qy 


541 


GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 

MM IIIIIIIIMI MIIIMIIIIIIIIIIII Ml lllllll Ml 1 1 

GGGAACAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTG 


600 


Db 


5218 


5277 


Qy 


601 


TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 

I 1 II 1 II II II 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 1 II 1 1 1 M II II II 1 1 1 1 

TGCTGGGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCC 


660 


Db 


5278 


5337 


Qy 


661 


ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 

N II 1 1 M II II II M M M M 1 M 1 II 1 1 II II M 1 II 1 II 1 1 M 1 II M 1 

ACGTGCGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCG 


720 


Db 


5338 


5397 


Qy 


721 


ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 
II II lllllll 1 M 1 II II 1 1 1 1 1 II II II 1 1 1 1 II II 1 1 II 1 II II 1 1 1 
ACAGCGCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTT 


780 


Db 


5398 


5457 


Qy 


781 


TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 

1 M M II II 1 1 1 1 1 II II 1 1 II 1 1 1 M II 1 M 1 M III 1 

TCGTGGTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCG 


840 


Db 


5458 


5517 


Qy 


841 


ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 
INN II M II 1 II 1 II II 1 II II II III 1 II II II M 1 II II 
ATGTGACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCA 


900 


Db 


5518 


5577 


Qy 


901 


TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 

INIINMMMMM II II II 1 1 II II 1 1 1 1 II II II 1 1 1 II 1 1 llllllllll 

CACAGTGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGG 


960 


Db 


5578 


5637 


Qy 


961 


CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 

i N 1 M M 1 II M II 1 II M 1 1 1 II II 1 1 1 II II II II II i 1 1 M 1 

CTGGCGGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGA 


1020 


Db 


5638 


5697 


Qy 


1021 


TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 

lllllll II INN lllllll II II 1 1 1 II II II 1 1 1 II II II II II 1 II 1 1 

TCTTTAAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 


1080 


Db 


5698 


5757 


Qy 


1081 


CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 

1 > t M II Mini lllllll II 1 1 M 1 II III 1 III 

CACTCTGGACATTGGCTAAGCCTTGA— GAGCCAGGGAGACTTTCTGAAGTAGCCTGATC 


1140 


Db 


5758 


5815 


Qy 


1141 


AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 
1 II 1 1 1 N 1 II 1 1 II 1 II Ml 1 1 1 II 1 1 1 1 II 1 III II II 
TTTCTAGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGG 


1200 


Db 


5816 


5874 


Qy 


1201 


TCCTGTAGCAG— GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 1245 
1 1 IN III 1 1 1 1 M II II 1 II 1 M 1 II 1 
TGTTCCTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 5921 




Db 


5875 





RESULT 8 
AAT61677 

ID AAT61677 standard; DNA; 8174 BP. 
XX 

AC AAT61677; 
XX 

DT 19-JUN-1997 (first entry) 
XX 

DE Human alpha ( 1, 2 ) -fucosyltransf erase DNA. 
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ov^v^Avij, ocoiv^u xvcaulua A^cuuia iui j-i.yyn^aiiKJU v:70-rn^uo oiiu ocaic^u ivcauiL ua-vj^-o-rn,.. ra^c 
XX 

KW Alpha (1, 2) -fucosyltransferase; glycosylation; oligosaccharide; 

KW blood group H; ss. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 4686. .5783 

FT /*tag= a 
XX 

PN WO9709421-A1. 
XX 

PD 13-MAR-1997. 
XX 

PF 06-SEP-1996; 96WO-US013816 . 
XX 

PR 08-SEP-1995; 95US-00525058 . 
XX 

PA (UNMI ) UNIV MICHIGAN. 
XX 

PI Lowe JB, Legault DJ; 
XX 

DR WPI; 1997-192897/17. 

DR P-PSDB; AAW1364 0. 
XX 

PT New recombinant fucosyl : transferase proteins - useful for modifying cell 

PT surface oligosaccharide structures. 

XX 

PS Example 1; Page 274-279; 329pp; English. 
XX 

CO A DNA sequence (AAT61677) codes for human GDP-Fuc :beta-D- galactoside 

CC alpha (1,2) -fucosyltransferase (AAW13640) , an enzyme involved in the 

CC expression of type I and II blood group H structures. It was obtd. by 

CC trans fee ting mouse L cells with DNA derived from human A431 cells, and 

CC selection of transf ectants that expressed the H antigen by using anti-H 

CC antibody in a panning procedure. The DNA can be used to construct animal 

CC cell lines with specific capabilities with respect to post-translational 

CC modification of the oligosaccharides of expressed proteins or lipids, or 

CC to produce recombinant enzyme for use in oligosaccharide prodn 
XX 

SQ Sequence 8174 BP; 1628 A; 2229 C; 2322 G; 1995 T; 0 U; 0 Other; 



Query Match 63.7%; Score 808.6; DB 2 

Best Local Similarity 79.6%; Pred. No. 7.7e-178 
Matches 993; Conservative 0; Mismatches 249 



Length 8174; 

Indels 5; Gaps 3; 



Qy 


1 


Db 


4678 


Qy 


61 


Db 


4738 


Qy 


121 


Db 


4798 


Qy 


181 


Db 


4858 



CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 
II M 11 I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCTCTCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCC 

TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 

II I IMIIIIIIIIIII II I II lllllll nil 

TGTCGATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGC 

CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 
I I I I I I III I I I I I I I I I I M I I I I I I I I I I I I I I I I I 11 I I I I I I 
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ov^vyxvij, ocaiv.li ivcauiia xyctoiia lui i-LppiiL.auuu u:70'i^-t^uo oiiu ocan^u ivcauit ua-v:7-o-r-r... x a^c wi -ru 



Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

Illlllll Illlllllllll II I I i I II M I I M I I II I II 

Db 4918 CCGGCACCTGGACTGTCTACCCCT^TGGCCGGTTTGGTAATCAGATGGGACAGTATGCCA 4977 

Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

I I I I I M I I I Mill I I II I I M I I I I II M MINI I I I I I M M M I M I I I 
Db 4978 CGCTGCTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATG 5037 

Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

IN I I I I I I I I I II I I I I I I M I I I I I II I II I I I I I II M II I II I r I 

Db 5038 CCGCCCTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCC 5097 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I I II I 11 I II I I I I II I I II I I I I I I I I II II I I I I I I I I II II II I I I 
Db 5098 GCACGCCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGA 5157 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

II II I I I I II I II I I I I I I I I I I I I II I I I II I I I I I I I M I M I I I I I 

Db 5158 GAGATCCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCC 5217 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I M II M I I I M I I III I I 

Db 5218 GGGAACAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTG 5277 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I III INN MM II II M II II II M II II M M I M II II II I I I I 

Db 5278 TGCTGGGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCC 5337 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I M I I II II II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 5338 ACGTGCGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCG 5397 

Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

II M M II I I I I I I II I II I I I I I I I I I I I I I I I M I I II I I I I I I I I I I 

Db 5398 ACAGCGCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTT 5457 

Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I M M II I I I I I I M II I I I I I I I I I I I I I I II 1 I II Ill III I 

Db 5458 TCGTGGTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCG 5517 

Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I I M I I I M I II I I I I I I M III I I II I I 

Db 5518 ATGTGACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCA 5577 

Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

M II II 11 I M I I I I II I M I I I I II II I M I M I I I I 

Db 5578 CACAGTGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGG 5637 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

I M I 11 t I I I I M II I I II I II I I I I I I I I II I I II I 11 I I II I II 

Db 5638 CTGGCGGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGA 5697 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I I I II I I M II II I M I II I I Ill M I I I I I I I I I I I II I I I M I 

Db 5698 TCTTTAAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 5757 

Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

Mill II II I I I I II II I M M I I I II • II III I III 

Db 5758 CACTCTGGACATTGGCTAAGCCTTGA— GAGCCAGGGAGACTTTCTGAAGTAGCCTGATC 5815 
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ov^vyxvj^ ocon^ii ivcauiia x^cuuia iwi r^yyuy^aiisjii v>^o-rn^uo oiiu ocaicii xvcauit ua-v>r-o^n.. . x a^c ^-t, ui nu 



Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 

Db 5816 TTTCTAGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGG 5874 

Qy 1201 TCCTGTAGCAG— GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 1245 

I I III I I I I I I I M I I I I I I I I I I I I M 

Db 5875 TGTTCCTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 5921 



RESULT 9 
ACN37237/C 



ID ACN37237 standard; DNA; 20001 BP. 
XX 

AC ACN37237; 
XX 

DT 18-NOV-2004 (first entry) 
XX 

DE Human periodontal disease related gene FUTl SEQ ID NO: 147. 
XX 

KW periodontal disease; polymorphism; ds; human; gene; SNP; 

KW single nucleotide polymorphism. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT misc_feature 222 

FT /*tag= a 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide G,A" 

FT misc_feature 2465 

FT /*tag= b 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A, C" 

FT misc_feature 2466 

FT /*tag= c 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A,G" 

FT misc_feature 4 624 

FT /*tag= d 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide G,A" 

FT misc_feature 5112 

FT /*tag= e 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A,G" 

FT misc_feature 6298 

FT /*tag= f 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide T,C" 

FT misc_feature 697 6 

FT /*tag= g 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A, G" . 

FT misc_feature 8485 

FT /*tag= h 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide T,C" 

FT misc_feature 9457 

FT /*tag- i 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide deletion, CACACACA" 
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ov^vyxNXi ocon^ii xvcauiLs j^cLOiid lui -TLfipiivauuii v/:7o^-t^uo oiiu ocaicii xvcauiL u:>-\jy-o^-t. 



FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



misc feature 



9945 

/*tag- j 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide C,A" 
10001 
/*tag= k 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide G,A" 
10397 
/*tag= 1 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide A, G" 
10821 
/*tag= m 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide A, T" 
10944 
/*tag= n 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide T,C" 
11507 
/*tag= o 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide A,G" 
11635 
/*tag= p 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide T,C" 
11963 
/*tag= q 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide T,G" 
12750 
/*tag= r 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide G,A" 
12819 
/*tag= s 

/standard_name= "Single nucleotide 
/note= "Variable nucleotide A, T" 

,14634 

V*tag= t 

/standard_namie= "Single nucleotide 



polymorphism" 



polymorphism" 



polymorphism" 



polymorphism" 



polymorphism" 



polymorphism" 



polymorphism" 



polymorphism" 



polymorphism" 



polymorphism" 



polymorphi sm" 
Single nucleotide polymorphism" 



/note= "Variable nucleotide T,C" 
14709 
/*tag= u 
/ s t a nda r d_name= 

/note== "Variable nucleotide C,G" 
15985 
/*tag= V 

/standard_name= "Single nucleotide polymorphism" 
/note= "Variable nucleotide C,A" 
16968 
/*tag= w 

/standard_name= "Single nucleotide polymorphism" 
/note= "Variable nucleotide C,T" 
17124 
/*tag= X 

/standard_name= "Single nucleotide polymorphism" 

/note= "Variable nucleotide C,T" 

17312 
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FT /*tag= y 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide C,G" 

FT misc_feature 17567 

FT /*tag= z 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide C,G" 

FT misc_feature 17621 

FT /*tag= aa 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide C,T" 

FT misc_feature 17775 

FT /*tag= ab 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A, G" 

FT misc_feature 18137 

FT /*tag= ac 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A, G" 

FT misc_feature 18225 

FT /*tag= ad 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A, G" 

FT misc_feature 1834 7 

FT /*tag= ae 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide C,T" 

FT misc_feature 18879 

FT /*tag= af 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A, G" 

FT misc_feature 18923 

FT /*tag= ag 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A,C" 

FT misc_feature 19284 

FT /*tag= ah 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A,G" 

FT misc_feature 19609 

FT /*tag= ai 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide C,G" 

FT misc_feature 19714 

FT /*tag= aj 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide T,C" 

FT misc_feature 19753 

FT /*tag= ak 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide A, T" 

FT misc_feature 19754 

FT /*tag= al 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "Variable nucleotide C,G" 

XX 

PN WO2004042054-A1. 
XX 

PD 21-MAY-2004. 
XX 

PF 22-OCT-2003; 2003WO-IB004 669 . 
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23-OCT-2002; 2002 JP-00308 634 . 

(HUBI-) HUBIT GENOMIX INC. 
(KAMO/) KAMOI K. 

Kamoi K, Suzuki A, Numabe Y, Ji G, Muramatsu M, Baba M; 
WPI; 2004-400678/37. 

Single nucleotide polymorphisms associated with periodontal disease for 
examination and assessment of susceptibility to periodontal disease. 

Claim 9; SEQ ID NO 147/ Opp; Japanese. 

The invention relates to a novel method for examination of periodontal 
disease in which genetic polymorphisms are detected in one or more of 51 
genes. The method is useful for examination, diagnosis and assessment of 
periodontal disisase or risk of periodontal disease and the risk of its 
progression to severe, aggressive and chronic periodontal disease. The 
present sequence represents a polymorphic gene of the invention 

Sequence 20001 BP; 4519 A; 5818 C; 5572 G; 4092 T; 0 U; 0 Other; 

Query Match 63.7%; Score 808.6; DB 13; Length 20001; 

Best Local Similarity 79.6%; Pred. No. 9.6e-178; 

Matches 993; Conservative 0; Mismatches 249; Indels 5; Gaps 3; 



Qy 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

II I III MM II I I II M I III M M I M M M M M M I 

Db 12792 CTGCAGCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTG 12733 

Qy 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

I I i I I II I I II I I II I I I II I Ill II I I II I I I 

Db 12732 TCCTCTCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCC 12673 

Qy 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

II I I I M t I I I II II I I I I I I I II I I I I I I I II I I t I I II I I I I 

Db 12672 TGTCGATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGC 12613 

Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

HUM Ml II I I II II I I M I I I I II I II I I II I I I I I I I II I II 
Db 12612 CGGGTACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCT 12553 

Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

I M I II I II I II I I II I I I I II I II I I II I I I I I I II II II I II I I I I I II II 
Db 12552 CCGGCACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCA 12493 

Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

M II I II I I I I I I I I I I I II I I I II I I I II I II I I II I I I I I I I II I I M I I I I 
Db 12492 CGCTGCTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATG 12433 

Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

I M lllllllllll Mill I M II II Mill I 

Db 12432 CCGCCCTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCC 12373 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I I II M I Illllllllllll Mill II II MM I 

Db 12372 GCACGCCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGA 12313 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 



XX 
PR 
XX 
PA 
PA 
XX 
PI 
XX 
DR 
XX 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 
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o^vrivu, ocoiL/ii xvcauita J^ctaiia tui rxyyiiy^aiiuii u:7o-rn-^\Jo oiiu ocaiv^ii xvcauii ua-u3?-o-t-r... x a^c ^vj ox -ru 









J *i ± 


UD 


1 O O R O 


yy 


f;n 1 

DU ± 


DD 


12192 


Qy 




DD 


12ioZ 


Qy 


/ Z 1 


DD 


1 o rt T o 

12072 


Qy 


T O 1 


UD 


12012 


Qy 




Db 


11952 


Qy 


901 


Db 


11892 


Qy 


9 bl 


Dd 


11832 


Qy 


1021 


Dd 


11772 


Qy 


1 no 1 


Db 


11712 


Qy 


11/11 


Db 


11654 


Qy 


1201 


Db 


11595 


RESULT 


10 



Mill I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

GAGATCCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCC 12253 
GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I Ml I I Ml I I 

GGGAACAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTG 12193 
TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I I I I I II I I I I I I M M I II M M M II I II M I II II II II I I I I II 

TGCTGGGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCC 12133 
ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I I I I I I M II I II I M I I I I I I I II I I II I M I I I I M I M II I I I M I II I 

ACGTGCGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCG 12073 

ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 
.11 II II I I I I I I I M I I I I I I I I II II M I I I I I II II I I I I I I I I I i I I 
ACAGCGCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTT 12013 

TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I M I I I I III M I II I I II I I I I I II II I I I I I I I II I I M I I I I M III I 

TCGTGGTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCG 11953 

ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 
I M I I I I I I I I I I I I I I I I I M I I I I III I I M I I I I I I I I I I 

ATGTGACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCA 11893 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

I I I I I I I I I M I I I I I I I I I I I II M I I I I I I I I I I I I II I II I II II II I I I I I I 
CACAGTGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGG 11833 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

I I II I I I I I I I I I I I I M I I I II I I I II I I II I I M II II I I I M I 

CTGGCGGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGA 11773 
TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I I I I I M M Mill II I I M I I II II I I I I II I I I I II I I I I I I II I II I I I I II I I 

TCTTTAAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 11713 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 
I I I II II I I I I I I I I I I M I I II I M I II III I I I I 

CACTCTGGACATTGGCTAAGCCTTGA—GAGCCAGGGAGACTTTCTGAAGTAGCCTGATC 11655 



I M I I I II I II I I II I II I I I I I I I I I I I II I I Ml II I I 



TCCTGTAGCAG — GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 124 5 
I I I 11 Ml I I II I I I I I I II I II I I M I 

TGTTCCTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 1154 9 



AAQ56908 



ID 
XX 
AC 
XX 
DT 
DT 
XX 



AAQ56908 standard; DNA; 8174 BP. 



AAQ56908; 

25- MAR-2003 

26- JUL-1994 



(revised) 
(first entry) 
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DE DNA encoding a glycosyltransf erase . 
XX 

KW Glycosyltransferase; fucosyltransf erase; GDP-Fuc; in vitro; cell; 

KW surface; oligosaccharide; ss . 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 4686. .5783 

FT /*tag- a 

XX 

PN WO9402616-A1. 
XX 

PD 03-FEB-1994. 
XX 

PF 20-JUL-1993; 93WO-US006703 . 
XX 

PR 20-JUL-1992; 92US-00914281 . 
XX 

PA (UNMI ) UNIV MICHIGAN. 
XX 

PI Lowe JB; 
XX 

DR WPI; 1994-048874/06. 
DR P-PSDB; AAR45936. 
XX 

PT DNA fragment encoding a glycosyltransferase - can be used for in vitro 
PT reactions to modify cell surface oligosaccharide ( s ) e.g. blood gp. 
PT determinants, to protect against transplant rejection. 
XX 

PS Disclosure; Fig 3; 249pp; English. 
XX 

CC The sequence is that encoding hioman glycosyl transferase. The enzyme 

CC produced by the DNA may be non glycosylated. This prevents premature loss 

CC of enzyme activity. It can also be used in in vitro reactions to modify 

CC cell surface oligosaccharide mols . e.g. blood group determinants. See 

CC also AAQ56905-12. (Updated on 25-MAR-2003 to correct PN field ) 

XX 

SQ Sequence 8174 BP; 1628 A; 2228 C; 2322 G; 1996 T; 0 U; 0 Other; 



Query Match 63.6%; Score 807; DB 2; Length 8174; 

Best Local Similarity 79.6%; Pred. No. 1.8e-177; 

Matches 992; Conservative 0; Mismatches 250; Indels 5; Gaps 3; 

CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 
II I I I I I M I I I III I I II II II MM! Ill I I I I II I I II I I I I I I I I 



I I I I I I I I I I I I I I I I II I I M M I I I I II II II I I I I I 



II I Milllllllllll I MM II I II llllllll IMIIIIII 

TGTCGATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGC 4857 



I I I I I I III M II I II M M II II II II II I I II II II M I I II I I 
:gggtactgcgatgggccccaacgcctcctcttcctgtccccagcaccctgcttccctct 491- 

241 ccgggacctggactatttacccggatggccggtttgggaaccagatgggacagtatgcca 300 



Qy 


1 


Db 


4678 


Qy 


61 


Db 


4738 


Qy 


121 


Db 


4798 


Qy 


181 


Db 


4858 


Qy 


241 
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I I I I I I M I I I I I I I I I II I I II I I II I I I I I I 

CCGGCACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCA 4977 

CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

I I I I I M I I I I M I I M I I I I I I I I I I I I II I II I I I MM I I I M M M I I I I 

CGCTGCTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATG 5037 
CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

IN I I I I II M I M II I It I II I I I Mill II II I II II II II Mill I 

CCGCCCTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCC 5097 
ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I I II M M M II M II II I M I II II M I II I M I II I I I M II MM I 

GCACGCCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGA 5157 

AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

II II I II II II II I I M I II I I II I I M I Mill II II I I M II I II II 
GAGATCCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCC 5217 

GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 
INI I 11 II I I M I I II II II I I I II I II I II I I II M II M I II I II I III I I 
GGGAACAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTG 5277 

TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 
I M I II I II II M II II II I I I I I II II I II t I II II II I I I II II I I 
TGCTGGGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCC 5337 

ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I I 11 t I II II M I II I II II I I 

ACGTGCGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCG 5397 

ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

II M II M II I I II II I II I I M I II II II II II II II I II II I I II II I 

ACAGCGCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTT 5457 

TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 
I I I I I II II II II I II II II II II I II I II t I I II II II I M II I II III I 

TCGTGGTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCG 5517 

ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 
I I M I II II M I II II II I I II II M III I I II I II II I II II 

ATGTGACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCA 5577 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

I M I I I I II II II II II II II II M i I M II II M II II II II M t II II It II II 
CACAGTGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGG 5637 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 
M I I I M M 11 II I It 1 It I II I II II II II 1 I II II It II It II 

CTGGCGGAGACACTGTCTACCTGGCCAACTTCACCCTGGCAGACTCTGAGTTCCTGAAGA 5697 

TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I M M II II II II I M M II II t II II II II 11 II II II 1 M M II II II II It II I 

TCTTTAAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 5757 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 
I I I II It It II II II II II I I II II II II III I III 

CAC TCTGGACATTGGCTAAGCCTTGA- - GAGCC AGGGAGAC TTTC TGAAGTAGCCTGATC 5815 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 
ill M I I I I 1 II It I I I t II 1 I II I II II I I II I 



Db 


4918 


Qy 


301 


Db 


4978 


Qy 


361 


Db 


5038 


Qy 


421 


Db 


5098 


Qy 


481 


Db 


5158 


Ov 


541 


Db 


5218 


Ov 


601 


Db 


5278 


Ov 


661 


Db 


5338 


Ov 


721 


Db 


5398 


Qv 


781 


Db 


5458 


Qv 


841 


Db 


5518 


Qv 


901 


Db 


5578 


Qv 


961 


Db 


5638 


Qy 


1021 


Db 


5698 


Qy 


1081 


Db 


5758 


Qy 


1141 
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Db 5816 TTTCTAGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGG 5874 

Qy 1201 TCCTGTAGCAG— GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 124 5 

I I III III I I II .11 I I I I I I I I M I I I I 

Db 5875 TGTTCCTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 5921 



AAT7 67 68; 



15-SEP-1997 (first entry) 
Human alpha 1,2 fucosyltransf erase cDNA. 

Human GDP-L-fucose :beta-D-galactoside-2-alpha-L- fucosyltransf eraser- 
alpha 1,2 FT; transgene; transgenic mouse; animal model; 
intestinal adhesion; Helicobacter pylori infection; stomach; 
small intestine; gut; epithelial cell; surface receptor; 
carbohydrate antigen; gastritis; peptic ulcer; neoplasia; 
gastric adenocarcinoma; Lewis antigen; fucosylation; ds . 



RESULT 11 
AAT76768 

ID AAT7 6768 standard; 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 

m 

KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



cDNA; 3373 BP. 



Homo sapiens . 

Key 
CDS 



US5625124-A. 
29-APR-1997. 
ll-JUL-1994; 
ll-JUL-1994; 



Location/Qualifiers 
104. .1201 
/*tag= a 

/EC nvimber= "2.4.1.69" 



94US-00273411. 



94US-00273411. 



(UNIW ) UNIV WASHINGTON. 

Falk P, Gordon JI; 

WPI; 1997-258275/23. 
P-PSDB; AAW23805. 



Animal model for Helicobacter pylori infection - comprising transgenic 
mouse expressing human enzyme promoting intestinal adhesion. 

Example 1; Col 17-20; 24pp; English. 

A claimed transgenic mouse expresses, in its intestinal epithelial cells, 
the enzyme human GDP-L-fucose :beta-D-galactoside 2-alpha-L- 
fucosyltransferase (also called alpha 1,2 FT) or human GDP-L-fucose: beta 
-D-N-acetylglucosamide 3/4-alpha-L-fucosyltransf erase (also called alpha 
1,3/4 FT). The enzyme is expressed under the control of a gut epithelial 
cell-specific promoter and Helicobacter pylori adheres to the transgenic 
cells. The transgenic mouse and intestinal epithelial cells from it are 
useful as models for screening compounds for the ability to inhibit 
adhesion of H. pylori to gut epithelial cells. The present sequence 
encodes human alpha 1,2 FT and was published by Larsen et al.. 
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ov^vyxNjj, ocais^ii Avcauito xycLOiia lui r^yyii^au^Jii \j:7o^^^kjo aiiu ocai^ii avcauil ua-ui^-on^.. . x a^c jw 



CC Proc. Natl. Acad. Sci. USA, 87: 6674-6678 (1990) 
XX 

SQ Sequence 3373 BP; 687 A; 925 C; 905 G; 856 T; 0 U; 0 Other; 

Query Match 63.6%; Score 806.8; DB 2; Length 3373; 

Best Local Similarity 79.7%; Pred. No. 1.6e-177; 

Matches 990; Conservative 0; Mismatches 247; Indels 5; Gaps 3; 
Qy 6 GCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTA 65 

I I M I I I I I I I I MM I I M M M I M I M M M M M M I M M M I 

Db 101 GCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTGTCCTC 160 

Qy 66 GCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTG 125 

I I M I M M M M I M M M M n MM M M M I M M I 

Db 161 TCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCCTGTCG 220 

Qy 126 GCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGC 185 

I I I I M M M M M I M M M I M M M I M I M M M M I I M I 

Db 221 ATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGCCGGGT 280 

Qy 186 ACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGG 245 

i I III II I M M II II II M I II II I II M II M I II M I I II M I 
Db 281 ACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCTCCGGC 340 

Qy 24 6 ACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTG 305 

I I M II I II I II II I II II II I I I M II II II I I II I I II II II II II II II I I 

Db 341 ACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCACGCTG 400 

Qy 306 CTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTC 365 

I I I I I Mill II II II II II II II I I II I I II II II II II II I II II I MM I 
Db 401 CTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATGCCGCC 4 60 

Qy 366 CTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCT 425 

I I II I I II II II II II II II I MM! II II II I II II II II I II I I I 

Db 461 CTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCCGCACG 520 

Qy 426 CCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAG 485 

M II II II II II II II II M I II II II II I II I Mill II II II II I II 
Db 521 CCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGAGAGAT 580 

Qy 48 6 CCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAG 545 

II I I I I II I M I I II II II I II II II I II II I II I ri I II I I I I II I II I I 

Db 581 CCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCCGGGAA 640 

Qy 54 6 CAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTG 605 

M II II II II I II II I II II II I II I II II II II I II I II I II II III I II III 
Db 641 CAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTGTGCTG 700 

Qy 606 AGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTG 665 

lllll INI II MMMMIIMIMIM lllllll II II lllllllll 

Db 701 GGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCCACGTG 760 

Qy 666 CGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGC 725 

11 I II II I II I Ml III II 

Db 761 CGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCGACAGC 820 

Qy 726 CGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTG 785 

lllllll I M II I I II II II I I II II I II I II II II II M II II I II III 
Db 821 GCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTTTCGTG 880 

Qy 786 GTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTG 845 
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Db 


881 


INI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 III II III 

GTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCGATGTG 


940 


Qy 


846 


ATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAG 

1 MIIMIIIIIIM 1 MUM III 1 MIIMM Mill III 

ACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCACACAG 


905 


Db 


941 


1000 


Qy 


906 


TGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGT 

IIIIIIMI IIIIIMIIIIIIIII MIIMM IIIMIIIIIIIII 

TGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGGCTGGC 


965 


Db 


1001 


1060 


Qy 


966 


GGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTT 

1 1 1 1 1 1 1 1 1 1 II M 1 1 1 II 1 1 1 1 1 M II II II 1 II II M M 1 1 1 II 1 

GGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGATCTTT 


1025 


Db 


1061 


1120 


Qy 


1026 


AAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 

II II Hill II M 1 M 1 1 1 M t 1 II II 1 1 II II II M 1 II 1 1 1 II II M 1 II 1 j M 1 

AAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 


1085 


Db 


1121 


1180 


Qy 


1086 


CAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTCAACCC 

II M M M 1 II M M 1 1 M 1 1 1 II Ml 1 III | 

TGGACATTGGCTAAGCCTTGA— GAGCCAGGGAGACTTTCTGAAGTAGCCTGATCTTTCT 


1145 


Db 


1181 


1238 


Qy 


1146 


AGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGGTCCTG 
II mill N 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 II 1 1 II 1 1 II II 1 
AGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGGTGTTC 


1205 


Db 


1239 


1297 


Qy 


1206 


TAGCAG— GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 124 5 

III 1 1 1 1 1 M 1 II M 1 1 M II 1 11 1 1 
CTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 1339 




Db 


1298 





RESULT 12 
AAD46825 

ID AAD46825 standard; cDNA; 3373 BP. 
XX 

AC AAD4 6825; 
XX 

DT 27-JAN-2003 (first entry) 
XX 

DE Human alpha ( 1, 2 ) fucosyltransf erase (alphal,2FT) cDNA 
XX 



KW 
KW 



Carbohydrate antigen; alpha ( 1, 3) galactosyltransf erase; alphal,3GT; 
transgenic; alpha (1, 2) fucosyltransf erase; alphal,2FT; human; enzyme; 
KW gene; ss . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
H^T CDS 104. .1201 

FT /*tag= a 

/product^ "Human alphal,2FT protein" 

XX 

PN WO200274948-A2. 
XX 

PD 26-SEP-2002. 
XX 

PF 21-MAR-2002; 2002WO-CA000378 . 
XX 

PR 21-MAR-2001; 2001US-0277811P . 
XX 
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Mammalian cells, useful for producing animal tissues with carbohydrate 
antigens that are compatible for transplantation into human patients. 



PA (GERO-) GERON CORP. 
XX 

PI Denning C, Clark AJ, Schif f JM; 
XX 

DR WPI; 2002-759895/82. 

DR P-PSDB; AAE29226. 
XX 
PT 
PT 
XX 

PS Disclosure; Page 35-36; 71pp; English. 
XX 

CC The invention relates to animal tissues with carbohydrate antigens that 

CC are compatible for transplantation into human patients. The mammalian 

CC cell is inactivated homo zygously* for expression of alpha ( 1, 3) galactosyl- 

CC transferase (alphal, 3GT) gene and comprises a transgene for alpha (1,2)- 

CC fucosyltransferase (alphal, 2FT) . It is useful for producing animal tissue 

CC with carbohydrate antigens that are compatible for transplantation into 

CC human patients. The present sequence is human alphal, 2FT cDNA 
XX 

SQ Sequence 3373 BP; 687 A; 925 C; 905 G; 856 T; 0 U; 0 Other; 

Query Match 63.6%; Score 806.8; DB 6; Length 3373; 

Best Local Similarity 79.7%; Pred. No. 1.6e-177; 

Matches 990; Conservative 0; Mismatches 247; Indels 5; Gaps 3; 
Qy 6 GCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTA 65 

lllllllll III MM M M M I M M I M I M M M M M M M M I 

Db 101 GCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTGTCCTC 160 

Qy 66 GCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTG 125 

I I t M M I M M I I M M M M M MM M M M I M M 1 

Db 161 TCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCCTGTCG 220 

Qy 126 GCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGC 185 

III i lllllllll I M I 11 I II I I 

Db 221 ATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGCCGGGT 280 

Qy 186 ACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGG 245 

N Ml II I II I M II I II I M I II I I I I I I I I I II II II I I M I I I 

Db 281 ACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCTCCGGC 340 

Qy 246 ACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTG 305 

INIIIIII I Mill Mi I MIIIIIMIMIMIIIIIIII 

Db 341 ACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCACGCTG 400 

Qy 306 CTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTC 365 

I I I M Mill I M I I I M I I II I M I MUM MM II II II M I I I I MM I 

Db 401 CTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATGCCGCC 4 60 

Qy 366 CTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCT 425 

nil I IMMIIMM IIIM M MM I Mill I I I 

Db 461 CTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCCGCACG 520 

Qy 426 CCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAG 485 

II Mill Ml I I II II Mill M 

Db 521 CCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGAGAGAT 580 

Qy 486 CCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAG 545 

M I MMMMI I llllllllllllll Mill lllllllllll llllllll 

Db 581 CCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCCGGGAA 640 
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o^wivl:. ocaiv.li xvcauiia i^ctoiia lui ^ppuvatiuii v/^o^^^uo oiiu ocan.ii xvcauii ua-v^-o-^n.. / rage v^i 



Qy 


546 


CAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTG 
1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II III 

CAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTGTGCTG 


DUO 


Db 


641 


inn 


Qy 


606 


AGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTG 

1 1 M 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCCACGTG 


U U J 


Db 


701 


/ ou 


Qy 


666 


CGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGC 

Hill 1 1 II 1 1 1 1 1 1 1 1 1 M Mill 1 1 II 1 II II II II Ill III II 

CGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGrf^ArAar 




Db 


761 


ft c> n 


Qy 


726 


CGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTG 
N 1 1 i 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 t 1 1 1 I 1 III 
GCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTTTCGTG 


7ft R 


Db 


821 


ft ft n 

O o u 


Qy 


786 


GTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTG 
1 < 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 III II III 
GTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGrr;ATr;Tr 


ft J 


Db 


881 


jH\J 


Qy 


846 


ATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAG 

1 MIIIIIIIIIIM 1 mill III 1 llllllll Mill III 

ACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGrTPArArAr 




Db 


941 


lUUU 


Qy 


906 


TGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGT 

1 MMMI IMMIMMMMMMIMIMIMM Mill 

TGC AACC AC AC CAT T ATGAC CAT T GGC AC C T T C GGC T TC T Gaac TarrTurrn^rrrn^rnr 


OCR 

y bo 


Db 


1001 


lObO 


Qy 


966 


GGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTT 
1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 II 1 1 1 II 1 II 1 1 t 1 1 1 1 1 1 1 II 
GGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGATrTTT 


1025 


Db 


1061 


lion 


Qy 

Db 


1026 
1121 


AAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 

It II Mill MIMMMMMMMMMMMMMIMMMMMIMIMM 

AAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTrrrrArTr 


1 nft t; 

XUO 0 

lion 


Qy 


1086 


CAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTCAACCC 

1 i 1 1 1 M 1 1 1 1 1 II 1 II II III 1 III 1 

TGGACATTGGCTAAGCCTTGA — GAGCCAGGGAGACTTTCTGAAGTAGCCTGATCTTTCT 


il4 0 


Db 


1181 


1 OTft 


Qy 


1146. 


AGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGGTCCTG 

1 II N 1 1 II 1 II III 1 1 1 MM Mill 1 

AGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGGTGTTC 




uD 




1297 


Qy 


1206 


TAGCAG— GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 124 5 

i II Ml 1 1 II II 1 It II II II 1 II II 
CTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 1339 




Db 


1298 





RESULT 13 
AAT12238 

ID AAT12238 standard; cDNA to mRNA; 1174 BP. 
XX 

AC AAT12238; 
XX 

DT 08-APR-1996 (first entry) 
XX 

DE Human H-transf erase cDNA. 
XX 

KW H-transferase; xenograft hyperacute rejection; transplantation; 
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ov^v^xvij, ocaiuii Avcauita x^cioiia lui rvppin.auuii Vv^o^^^uo miu ocai*.ii xvcauit ua-v^-o^-t.. . i a^c -ru 

KW glycosyltransferase; galactose alpha (1,3) galactose; ds 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 
FT CDS 15. .1112 

FT /*tag= a 

XX 

PN WO9534202-A1. 
XX 

PD 21-DEC-1995. 
XX 

PF 14-JUN-1995; 95WO-US007554 . 
XX 

PR . 15-JUN-1994; 94US-00260201 . 
PR 21-JUL-1994; 94US-00278282 . 
XX 

PA (ALEX-) ALEXION PHARM INC. 
PA ■ (AUST-) AUSTIN RES INST. 
XX 

PI Sandrin MS, Fodor WL, Rother RP, Squinto SP, Mckenzie IFC; 
XX 

DR WPI; 1996-049326/05. 
DR P-PSDB; AAR90572. 
XX . 
PT 
PT 
XX 
PS 
XX 



Redn. of rejection of xenogeneic cells following transplantation - by 
introducing a vector expressing fucosyl : transferase into the cells. 

Example 1; Page 45-47; 69pp; English. 



CC A cDNA clone {AAT12238) encoding human H-transf erase (AAR90572) was obtd. 

CC from cDNA prepd. from human epidermoid carcinoma cells (ATCC CRL 155 A- 

CC 431) by PGR using primers (AAT12240-41) based on H- transferase cDNA 5» 

CC and 3» untranslated regions. The cDNA can be incorporated into vector 

CC APEX-1 (AAT12239) for expression in xenogeneic organs, tissues and cells. 

CC This results in decreased expression of the non-human antigen galactose 

CC alpha (1,3) galactose on the surface of the organs etc. so that hyperacute 

CC rejection is reduced upon transplantation to humans 
XX 

SQ Sequence 1174 BP; 202 A; 388 C; 328 G; 256 T; 0 U; 0 Other; 

Query Match 62.6%; Score 7 94.2; DB 2; Length 1174; 

Best Local Similarity 80.7%; Pred. No. l.le-174; 

Matches 940; Conservative 0; Mismatches 223; Indels 2; Gaps 1; 



Qy 


6 


GCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTA 

1 1 1 1 1 1 t 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 

GCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTGTCCTC 


65 


Db 


12 


71 


Qy 


66 


GCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTG 

Mil 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCCTGTCG 


125 


Db 


72 


131 


Qy 


126 


GCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGC 

M 1 Ill t 1 II 1 1 1 1 II II 1 II II 

ATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGCCGGGT 


185 


Db 


132 


191 


Qy 


186 


ACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGG 

1 II 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 

ACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCTCCGGC 


245 


Db 


192 


251 
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ov^v^xvLi ocoiou Avcauita x^ctmia ivji /^^pin^auuu u;7o-t-tz.uo oiiu ocaiuu xvcauii u»-u:7-on-r... x a^c j-j vji -tu 



Qy 


246 


ACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTG 
1 > i 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 II 1 1 I 1 1 1 1 1 1 1 1 1 
ACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCACGCTG 


305 


Db 


252 


311 


Qy 


306 


CTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTC 

KIM I 1 1 1 1 1 1 1 1 1 1 1 II II Mil 1 1 1 1 II II II t 

CTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATGCCGCC 


365 


Db 


312 


371 


Qy 


366 


CTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCT 
1 1 1 1 1 1 1 i II 1 1 1 1 1 1 1 1 1 1 1 Mill II Mill II II II 1 1 1 1 1 1 1 1 
CTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCCGCACG 


425 


Db 


372 


431 


Qy 


426 


CCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAG 

II 1 1 1 1 1 1 1 1 II II 1 M 1 M 1 1 M 1 1 1 1 1 1 1 II Mill M II 1 II 1 1 II 

CCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGAGAGAT 


485 


Db 


432 


491 


Qy 


486 


CCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAG 

II 1 1 II II 1 1 1 1 1 M M 1 1 M 1 I 1 1 1 1 Mill 1 M 1 1 1 II 1 II 1 II 1 1 II 1 

CCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCCGGGAA 


545 


Db 


492 


551 


Qy 


546 


CAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTG 

1 1 1 1 1 1 1 M M 1 1 1 1 II M 1 1 1 1 II 1 1 II 1 1 1 M 1 II 1 1 II 1 1 M III 1 II III 

CAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTGTGCTG 


605 


Db 


552 


611 


Qy 


606 


AGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTG 
1 1 1 1 1 1 1 1 1 II 1 1 1 M II II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 II II 1 1 II 1 II 1 1 
GGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCCACGTG 


665 


Db 


612 


671 


Qy 


666 


CGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGC 

lllli MMIIII II Mill 1 1 II 1 II 1 1 t II 1 1 1 1 II 1 II 

CGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCGACAGC 


725 


Db 


672 


731 


Qy 


726 


CGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTG 
t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 M II II II II II 1 1 1 1 II t 1 1 II 1 1 1 
GCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTTTCGTG 


785 


Db 


732 


791 


Qy 


786 


GTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTG 

INIINNI Ill 1 IIIIMIIIIIII M III 

GTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCGATGTG 


845 


Db 


792 


851 


Qy 


846 


ATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAG 

1 III MM 1 IMIII III 1 1 M Mill Ml 

ACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCACACAG 


905 


Db 


852 


911 


Qy 


906 


TGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGT 

INIIIIIMIIil II II M 1 t 1 II II 1 II II M 1 II 1 MMMIIMIM 

TGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGGCTGGC 


965 


Db 


912 


971 


Qy 


966 


GGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTT 

1 1 1 II 1 1 M 1 II II 1 1 II II 1 II II 1 1 1 II II M 1 M II II M II II 

GGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGATCTTT 


1025 


Db ■ 


972 


1031 


Qy 

Db 


1026 
1032 


AAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 

II II IIIN MM 1 11 II II 1 II 1 M M 1 II 1 II II II II 

AAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 


1085 
1091 


Qy 


1086 


CAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTCAACCC 

II IIIIM M IMIII II Ml 1 III 1 

TGGACATTGGCTAAGCCTTGA— GAGCCAGGGAGACTTTCTGAAGTAGCCTGATCTTTCT 


1145 


Db 


1092 


1149 


Qy 


1146 


AGGGCCAGCGTTATGGGTCTCCGGA 1170 
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I I I I I I I I I I I I I I I I I 
Db 1150 AGAGCCAGCAGTACGTGGCTTCAGA 1174 



RESULT 14 
AAA53820 

ID AAA53820 standard; cDNA to mRNA; 1174 BP. 
XX 

AC AAA53820; 
XX 

DT 03-JAN'2001 (first entry) 
XX 

DE Human H-transf erase coding sequence. 
XX 

KW mesenchymal stem cell; stem cell; H-tranf erase; glycosyltransf erase; 

KW transplantation; xenotransplantation; transgenic animal; tissue injury; 

KW bone; joint; tendon; ligament; trauma; tumour infection; 

KW muscular dystrophy; osteoarthritis; rheumatoid arthritis; ds . 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 15. .1112 

FT /*tag- a 

FT /product= "H-transferase" 
XX 

PN WO200049136-A1. 
XX 

PD 24-AUG-20bO. 
XX 

PF 16-FEB-2000; 2000WO-US003 963 . 
XX 

PR 17-FEB-1999; 9 9US-01204 52P . 
XX 

PA (ussu ) US surgical: 

XX 

PI Gruskin EA, Tawil N, Geis D; 
XX 

DR WPI; 2000-533106/48. 

DR P-PSDB; AAY97279. 
XX 

PT Isolated mesenchymal stem cell comprising nucleic acid encoding an immune 

PT system suppressor polypeptide, useful for treatment of tissue injuries 

PT and/or tissue disorders such as bone and joint fractures. 
XX 
PS 
XX 

CC New methods are described for treating a tissue in need of repair or 

CC reconstruction comprising administering a composition comprising a 

CC mesenchymal stem cell where the stem cell differentiates into a cell 

CC which is normally indigenous to the tissue. The mesenchymal stem cell 

CC does not trigger an immune response in the recipient due to it being 

CC transfected with a vector comprising nucleic acid which encodes for an 

CC immune system repressor polypeptide. The mesenchymal stem cell may be 

removed from the recepient; transfected and then replaced or derived from 
a donor transgenic animal. The transfected mesenchymal, stem cells are 



CC 
CC 



CC 



Example 4; Page 40-41; 60pp; English. 



CC useful for treatment of tissue injuries and/or tissue disorders such as 



bone and joint fractures, bone defects resulting from trauma, tumour 
CC infection, tendon and ligament defects, congenital defects, muscular 
CC dystrophy, osteoarthritis, and rheumatoid arthritis. The use of the 
CC genetically engineered cells is advantageous because the need for 
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ov^WAVjj, ocoicii xvcauita i^cuuia lui r^yyn\^auun uj^o^-r^uo miu ocan^ii xvcauii ua-u^-o-t-r... 



CC 

cc 

CC 



allogenic human donors to provide non-immunogenic cells is eliminated. 
Suitable proteins or enzymes that when expressed in xenogenic cells are 
capable of reducing the expression of xenogenic antigens and thus reduce 
CC or inhibit rejection of the xenogenic cell when transplanted include 
CC glycosyltransferases such as human alpha-1, 2-fucosyltransf erase (H- 
CC transferase) and human lysosomal alpha gaiactosidase 
XX 

SQ Sequence 1174 BP; 202 A; 388 C; 328 G; 256 T; 0 U; 0 Other; 

Query Match 62.6%; Score 794.2; DB3; Length 1174; 

Best Local Similarity 80.7%; Pred. No. l.le-174; 

Matches 940; Conservative 0; Mismatches 223; Indels 2; Gaps 1; 



Qy 


6 


GCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTA 

MM III 1 M MM! Ml IMMIMIIIMIIMII 1 

GCC ATGTGGC TCCGGAGCCATCGTCAGC TP Tar r Tarir r T Trr T" r-r" rr 7\ r- ^Pr- rnr- m m/^ 


, 65 


Db 


12 


71 


Qy 


66 


GCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTG 
1 1 1 N 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


125 


Db 


72 


131 


Qy 


126 


GCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGC 
1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 II 1 II 1 1 1 1 1 1 

ATCCTGTGTCCAGACCGCPGPPTGGTf^npapprrrarTrr'r«n7\air*mair«m/^r«/-^m/-r^«^«^m 


185 


Db 


132 


191 


Qy 


186 


ACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGG 
> 1 III 1 1 1 1 1 1 II 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 

ACT GC GAT GGGCC CC AAC GC C TP P TP T T P P Tr; TP r r r* n r- r n r- r- r- n m m 


245 


Db 


192 


251 


Qy 


246 


ACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTG 

1 1 N 1 1 1 1 1 II II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ACC TGGAC TGT C T AC C C CAAT GGC C GGT TTGGT A A T P a r:;a Tfzr r n r n r T n qi r- r- 7\ r- rp/- 


305 


Db 


252 


311 


Qy 


306 


CTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTC 
III II II ill III III Mil 1 Ill nil 1 

CTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTT ATP PTapPTPPP a TrrnTnnr'/-nn 


365 


Db 


312 


371 


Qy 


366 


CTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCT 

M Mill MM M Mill II M M 1 I 

CTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTaaapappprrarr 


425 


Db 


372 


431 


Qy 


426 


CCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAG 
II 11 11 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 II 

I. i utrC trtrGAGC. i GC AGC T TCAC GAC T GGAT GT C GGAGGAGT ACGC GGAC T T GAGAGAT 


/IOC 


Db 


432 


491 


Qy 


486 


CCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAG 

Ml MMIIMI 1 MlillMIIIIII IIIM IIMIIIIIM llllllll 

CCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCCGGGAA 


545 


Db 


492 


551 


Qy 


546 


CAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTG 

Mill IIIIMM MIIIMIIIIMI II 1 Ml 

CAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTGTGCTG 


605 


Db 


552 


611 


Qy 


606 


AGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTG 

1 Illlllllllll IMIIM II II IMIIIIM 

GGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCCACGTG 


665 


Db 


612 


671 


Qy 


666 


CGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGC 

IIIM III MM II Mill Mllll Mlllll 

CGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCGACAGC 


725 


Db 


672 


731 
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o^\jr\jj, ocoi^n xvcdUiLd j^doiia lkji /^ppiiL/auuii u:70^n^uo oiiu ocan^ii Avcauii ua-ujy-o^T.., A'a^c jo 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I II I I I I I II Ml 



I I I M I I II I I I I I I I I I II M I I 11 I II I I I I I I I M I II I I III II III 



I I I M I II I I I I I II I II I II I III I I II I II I I I I M I II 



I I I II M I I I I I I I I I I II I II I I I II II I I I II I I I I I M I I I M I I I I I I I I I I I 



I I I M M I I I II I II I I I II I I I I I I I I I I M I I II I I I I I I I M II 

GGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGATCTTT 1031 



II M I I I I I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I M I II M I I I I I 



II I I I I I I 11 I I I I I I I I I I I I II IN I I II 



I I I I I I I I I I I I I I I I I 



RESULT 15 
AAT63575 

ID AAT63575 standard; DNA; 3791 BP. 
XX 

AC AAT63575; 
XX 

DT Ol-JUL-1997 (first entry) 
XX 

DE Chicken beta-actin promoter. 
XX 
KW 
KW 



Qy 


726 


Db 


732 


Qy 


786 


Db 


792 


Qy 


846 


Db 


852 


Qy 


906 


Db 


912 


Qy 


966 


Db 


972 


Qy 


1026 


Db 


1032 


Qy 


1086 


Db 


1092 


Qy 


1146 


Db 


1150 



Xenotransplantation; organ transplant; transgenic animal; transgenic pig; 
transgenic mouse; antibody mediated rejection; hyperacute rejection; 
KW antigen reducing enzyme; alpha (1, 2 ) fucosyltransf erase; beta-actin; 
KW promoter; ss . 
XX 

OS Gallus sp. 
XX 

PN WO9712035-A2. 
XX 

PD 03-APR-1997. 
XX 

PF 23-SEP-1996; 96WO-US015255 . 
XX 

PR 27-SEP-1995; 95US-0004461P. 
PR 03-JUL-1996; 96US-00675773 . 
XX 

PA (NEXT-) NEXTRAN. 
XX 
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PI Diamond LE, Logan JS, Byrne GW, Sharma A; 
XX 

DR WPI; 1997-225881/20. 
XX 

FT Transgenic animals expressing antigen reducing enzyme and complement 

PT inhibitor - used for production of materials suitable for human 

PT transplantation having a reduced risk of rejection. 
XX 

PS Example 6.1; Page 112-115; 146pp; English. 
XX 

CC The chicken beta-actin promoter (AAT63575) and the H2Kb gene promoter 

CC (AAT63576) can be utilised in novel vectors for the expression of gal 

CC epitope-reducing enzymes (e.g. human alpha (1, 2 ) fucosyl transferase) and 

CC complement inhibitors (e.g. CD59) on the endothelial cells of transgenic 

CC animals such as pigs and mice. The transgenic animals are used for the 

CC prodn. of materials suitable for transplantation to humans, such 

CC materials being less likely to produce an antibody-mediated reiection 
XX 

SQ Sequence 3791 BP; 825 A; 1092 C; 1036 G; 838 T; 0 U; 0 Other; 

Query Match 62.4%; Score 792.2; DB 2; Length 3791; 

Best Local Similarity 80.3%; Pred. No. 4.2e-174; 

Matches 929; Conservative 0; Mismatches 228; Indels 0; Gaps 0; 



Qy 


6 


GCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTA 

IIIM II MM M II Mill III IIIIIIIIIIIIIIIIIII 1 

GCCATGTGGCTCCGGAGCGATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTGTCCTC 


65 


Db 


2056 


2115 


Qy 


66 


GCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTG 

1 1 M 1 II II M II 1 II M M II M MM M 1 1 M 1 1 II 1 1 

TCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCCTGTCG 


125 


Db 


2116 


2175 


Qy 


126 


GCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGC 

> > I > 1 1 1 1 1 M II 1 1 MM II I M II II 1 M 1 1 M II 1 M 1 MM 
ATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGCCGGGT 


185 


Db 


2176 


2235 


Qy 


186 


ACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGG 
It III II 1 II M 1 II II 1 M II M II M II II II II 1 II 1 1 It II 1 
ACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCTCCGGC 


245 


Db 


2236 


2295 


Qy 


246 


ACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTG 

lllllll 1 MMIIIIM II MMMMMMMMMMIMI 

ACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCACGCTG 


305 


Db 


2296 


2355 


Qy 


306 


CTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTC 

1 1 1 1 1 Mill II M II 1 II II I II II MUM Mil M 1 II II II II 1 II II 1 

CTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATGCCGCC 


365 


Db 


2356 


2415 


Qy 


366. 


CTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCT 

M II IIIIIIIIMI Mill II Mill- II II tl II 

CTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCCGCACG 


425 


Db 


2416 


2475 


Qy 


426 


CCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAG 

MIMIM lllllll II M II Mill II 

CCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGAGAGAT 


485 


Db 


2476 


2535 


Qy 


486 


CCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAG 

II 1 MIIIMII 1 IIIMIIItlllM IIIM IMIIIIIIII MIMIM 
CCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCCGGGAA 


545 


Db 


2536 


2595 


Qy 


546 


CAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTG 


605 
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o^vyivi:, ocan^ii Avcauita iycLcuia nji j-\.yyiiK,auuu \jyo^-^^\jo auu ocai xvcauii u:>-\jy-i 



Db 


2596 


1 M 1 1 1 1 M 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 II III 

CAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTGTGCTG 


2655 


Qy 


606 


AGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTG 

II IN III! II 1 1 1 1 M 1 II 1 1 II M M 1 II III II II II II III INI 

GGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCCACGTG 


665 


Db 


2656 


2715 


Qy 


666 


CGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGC 

1 1 1 1 1 M 1 1 II 1 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 III II 

CGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCGACAGC 


725 


Db 


2716 


2775 


Qy 


726 


CGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTG 
1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 II III 
GCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTTTCGTG 


785 


Db 


2776 


2835 


Qy 


786 


GTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTG 

HUM 1 Illllll 1 llllllllllllllll III II III 

GTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCGATGTG 


845 


Db 


2836 


2895 


Qy 


846 


ATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAG 

1 II Illllll 1 IMIII 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

ACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCACACAG 


905 


Db 


2896 


2955 


Qy 


906 


TGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGT 
1 1 1 1 1 M 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 
TGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGGCTGGC 


965 


Db 


2956 


3015 


Qy 


966 


GGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTT 

Illllll 1 1 1 1 1 1 1 1 1 1 t 1 1 II 1 1 1 II 1 11 1 1 1 1 1 1 1 1 1 1 1 1 

GGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGATCTTT 


1025 


Db 


3016 


3075 


Qy 


1026 


AAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 
N II 1 II 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 


1085 


Db 


3076 


3135 


Qy 


1086 


CAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTCAACCC 
II MINI 1 1 1 1 1 I.I 1 1 1 1 1 1 1 1 1 II II 1 1 
TGGACATTGGCTAAGCCTTGAGTCGACGTGCAGGCATGCAAGCTTCGGGTGGACCCGGTC 


1145 


Db 


3136 


3195 


Qy 


1146 


AGGGCCAGCGTTATGGG 1162 
1 II III II 
AACTTCAAGGTGAGCGG 3212 




Db 


3196 





Search completed: November 14, 2006, 05:04:31 
Job time : 890 sees 



SCORE 1.3 BuildDate: 12/06/2005 



http://es/ScoreAccessWeb/GetItem.action?AppId=09844268&seqId=8248 1 3&ItemName=u.. . 1 1/1 5/06 



SCORE Search Results Details for Application 09 
and Search Result us-09-844-268-12.rnpb 

Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments / Suqq 

This page gives you Searcli Results detail for the Application 09844268 and Search Result us-09-8^ 
start 

Go Bac 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



November 13, 2006, 21:00:25 ; Search time 1815 Seconds 

(without alignments) 
8591.190 Million cell updates/sec 

US-09-844-268-12 
1269 

1 ctcgagccatgtgggtcccc actcagaggaaaccctgcag 1269 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



18892170 seqs, 6143817638 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



37784340 



Post-processing : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Published_Applications_NA_Main : 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 



/EMC_Celerra_SIDS3/ptodata/2/pubpna/US07_PUBCOMB.s 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US08_PUBCOMB.s 
/EMC_Celer r a_S IDS 3 /ptoda ta /2 /pubpna/US 0 9A_PUBC0MB 
/EMC_Celer r a_S IDS 3 /ptoda ta /2 /pubpna/US 0 9B_PUBC0MB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09C__PUBCOMB 
/EMC_Celerra__SIDS3/ptodata/2/pubpna/US10A_PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10B_PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10C__PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10D_PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10E_PUBCOMB 
/EMC_Celerra__SIDS3/ptodata/2/pubpna/US10F_PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10G_PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllA_PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllB_PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllC_PUBCOMB 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllD_PUBCOMB 



eq: * 
eq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
seq: * 
. seq: * 
. seq: * 
. seq: * 
. seq: * 
.seq:* 
. seq: * 
. seq: * 



Fred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
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and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 





No. 


Score 


Match 


Length 


DB 


ID 




1 


1269 


1 no 




1269 


3 


US-09-844-268-12 




2 


1269 


X \J L/ 


n 


1269 


3 


US-09-844-705-12 




3 


1094.8 


0 U 


0 
. J 


1098 


3 


US-09-051-034A-3 




4 


808. 6 


00 


7 


8174 


3 


US-09-863-475A-5 




5 


806.8 


00 


. 0 


3373 


6 


US-10-105-963-9 . 




6 


806.8 


DO 


. 0 


3373 


16 


US-11-219-419-9 




7 


454.2 




p 


1043 


3 


US-09-051-034A-1 




8 


454.2 


JO 


Q 


1043 


3 


US-09-254-077A-5 




9 


449.4 


0 D 


A 

. 4 


3088 


3 


US-09-954-456-45 




10 


449.4 




/I 
. fl 


3088 


3 


US-09-954-456-1621 




11 


449.4 


•J D 


/I 


3088 


3 


US-09-969-347-234 




12 


449.4 


00 


A 


3088 


10 


US-10-843-641A-3072 




13 


449.4 


OO 


/I 


3088 


10 


US-10-843-641A-4 648 




14 


449.4 


0 0 


/i 
. H 


3088 


10 


US-10-843-641A-8363 




15 


449.4 


'3 c; 


A 
. fl 


3088 


10 


US-10-756-149-275 




16 


. 423.8 


JO 


/I 


1068 


3 


US-09-999-672-9 




17 


423.8 


JO 


/I 


1068 


6 


US-10-040-863-9 




18 


423.8 


0 J , 


A 

. H 


1149 


3 


US-09-999-672-7 




19 


423.8 


0 J < 


A 

» H 


1149 


6 


US-10-040-863-7 




20 


415.2 


oZ . 


-7 


556 


7 


US-10-029-386-3291 


c 


21 


399 


'3 1 
Jl . 


A 

. H 


954 


7 


US-10-02 9-386-22 996 




22 


389.6 


oU . 


-7 


1742 


16 


US-11-136-527-2562 




23 


325.2 


0 

Z o . 


. D 


434 


7 


US-10-029-386-16991 




24 


172.6 


-Lo . 


. b 


607 


4 


US-09-925-065A-737 666 




25 


172.6 


1 "3 
X J . 


D 


607 


5 


US-09-925-065A-737 666 


c 


26 


129.4 


1 n 
XU . 




509 


7 


US-10-029-386-9295 




27 


103.6 


Q 




600 


16 


US-11-136-527-6658 


c 


28 


86 


a. 


0 
0 


620 


4 


US-09-925-065A-56199 


c 


29 


86 


D . 


Q 


620 


5 


US-09-925-065A-56199 


c 


30 


86 


0 . 


Q 
0 


620 


12 


US-10-301-480-157437 


c 


31 


86 


D . 


Q 


620 


12 


US-10-301-480-77084 6 




32 


74.4 




q 


100 


3 


US-09-999-672-23 




33 


74.4 


c 

0 . 


q 


100 


3 


US-09-999-672-26 




34 


74.4 




q 


100 


6 


US-10-040-863-23 




35 


74.4 


5 . 


q 


100 


6 


US-10-040-863-26 




36 


71.2 


5. 


6 


100 


3 


US-09-999-672-14 




37 


71.2 


5. 


6 


100 


3 


US-09-999-672-17 




38 


71.2 


5. 


6 


100 


6 


US-10-040-863-14 




39 


71.2 


5. 


6 


100 


6 


US-10-040-863-17 




40 


69.6 


5. 


5 


100 


3 


US-09-999-672-29 




41 


69.6 


5. 


5 


100 


6 


US-10-040-863-29 




42 


66.4 


5. 


2 


100 


3 


US-09-999-672-20 




43 


66.4 


5. 


2 


100 


6 


US-10-040-863-20 




44 


49.2 


3. 


9 


1065 


7 


US-10-369-493-42988 




45 


48.6 


3. 


8 


7788 


7 


US-10-329-079-8 
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Sequence 12, Appl 
Sequence 3, Appli 
Sequence 5, Appli 
Sequence 9, Appli 

Sequence 9, Appli 
Sequence 1, Appli 
Sequence 5, Appli 
Sequence 45, Appl 
Sequence 1621, Ap 
Sequence 234, App 
Sequence 3072, Ap 
Sequence 4648, Ap 
Sequence 8363, Ap 
Sequence 275, App 
Sequence 9, Appli 
Sequence 9, Appli 
Sequence 7, Appli 
Sequence 7, Appli 
Sequence 3291, Ap 
Sequence 22996, A 

Sequence 2562, Ap 
Sequence 16991, A 
Sequence 737666, 
Sequence 737666, 
Sequence 9295, Ap 

Sequence 6658, Ap 
Sequence 56199, A 
Sequence 56199, A 
Sequence 157437, 
Sequence 770846, 
Sequence 23, Appl 
Sequence 26, Appl 
Sequence 23, Appl 
Sequence 2 6, Appl 
Sequence 14, Appl 
Sequence 17, Appl 
Sequence 14, Appl 
Sequence 17, Appl 
Sequence 29, Appl 
Sequence 29, Appl 
Sequence 20, Appl 
Sequence 20, Appl 
Sequence 42988, A 
Sequence 8, Appli 



ALIGNMENTS 



RESULT 1 

US-09-844-268-12 

; Sequence 12, /^plication US/09844268 
; Patent No. US20020129395A1 
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GENERAL INFORMATION: 
APPLICANT: BOSWORTH, BRAD 
APPLICANT : VOGELI , PETER 

TITLE OF INVENTION: METHODS AND COMPOSITIONS TO IDENTIFY SWINE GENETICALLY 
TITLE OF INVENTION: RESISTANT TO F18 E. COLI ASSOCIATED DISEASES 
FILE REFERENCE: 21419/90368 
CURRENT APPLICATION NUMBER: US/09/844,268 
CURRENT FILING DATE: 2001-04-27 
PRIOR APPLICATION NUMBER: 09/443,766 
PRIOR FILING DATE: 1999-11-19 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: PatentIn Ver . 2.0 
SEQ ID NO 12 
LENGTH: 1269 
TYPE: DNA 
ORGANISM: Porcine 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (9) . . (1103) 
US-09-844-268-12 

Query Match 100.0%; Score 1269; DB 3; Length 1269; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1269; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

N M I I I I I I i I i I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I 
1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

Qy 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I { I I I I I I I I I I I 

Db 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

Qy 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

lIMM I I I I I I I I I I I I I I I I I I I I I I I I I IIIIIIMIIIMI 

Db 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

I I I I I > I I I M I I t I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I 

Db 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

iliiMIMIIItllllillllllll lllllilM lllllltllllMI 

Db .241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

> I I I I I I I I I I I I I I.I I I I I I I I I M I I I I I I I I I I I { I I I I M I I I I I I I t I I i I I I I I 
Db 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

NIIMMIIIIIIIIIMIIIIII Illllllllllllllllllllllilllll 

Db 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

IIMIIIMIIIIIIIIII IIIIIIMIIIIIIIItllllllllll MM 

Db 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

I I < I I I I I I M I M I I II I I II I I M I II II I I II I II I I II M II II I II I II I 

Db 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 
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Qy 541 ,GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

IIIIIIINIIIIMIMIIIIIIIIIIIIMIIIliiiMiiiMilllllllllltll 

Db 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

Qy 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

mil IIMIIIIIIMIIIIIIIIIIMllilliiiMMIIIIIIIIIIIIII 

Db 601 TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

Qy 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 661 ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 720 

Qy 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

III I I I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I 

Db 721 ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

Qy 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

I I I I I I M I I I I I I I I M I I I I I I I I I M I I I I I .I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 781 TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 

Qy 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

I I I I I I I I I I I M II I I II I I II I I II II I I I I M I I I I I I I M I I I I I I I II I I I II I I 
Db 841 ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 

Qy 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

I I I I I I I I I I I I I I I I 11 I I II I I I I I I M I I I I II M M I I I I I M M I I M I I I M M 

Db 901 TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 960 

Qy 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

I I I I I I M I I I I I M II II I I I I II I I M I I I I M I I M II I I I I I I I I I I Ml II I I II 

Db 961 CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 

Qy 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

I I I I I I I I I I I I I I II I I I I II I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M 
Db 1021 TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 

Qy 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

I I It I I I I I I I I I I I I I 1. 1 I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I 
Db 1081 CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 

Qy 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

I M I I I M I I II I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I II 

Db 1141 AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

Qy 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 1260 

I II I I I I I I 1 I I I I M I I I I I I I I I M I I I I I I I I I I I II I I M I I I I I M 11 I I I M I I 

Db 1201 TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 12 60 

Qy 1261 ACCCTGCAG 1269 

I I II I I I I I 
Db 12 61 ACCCTGCAG 1269 

RESULT 2 

US-09-844-705-12 

Sequence 12, Application US/09844705 
Patent No. US20020133836A1 
GENERAL INFORMATION: 
APPLICANT: BOSWORTH, BRAD 
APPLICANT: VOGELI, PETER 

TITLE OF INVENTION: METHODS AND COMPOSITIONS TO IDENTIFY SWINE GENETICALLY 
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o^^vyivij, ocmuii xvcauiLa i^cuuia lui ^ppiic/atiuii kj^o^^^^o oiiu ocan^ii ivcauit ua-w:7-o-r-r-z,... rage ^ vji 

TITLE OF INVENTION: RESISTANT TO F18 E. COLI ASSOCIATED DISEASES 
FILE REFERENCE: 21419/90368 
CURRENT APPLICATION NUMBER: US/09/844,705 
CURRENT FILING DATE: 2001-04-27 
PRIOR APPLICATION NUMBER: 09/443,766 
PRIOR FILING DATE: 1999-11-19 
NUMBER OF SEQ ID NOS : 13 
SOFTWARE: Patentin Ver . 2.0 
SEQ ID NO 12 
LENGTH: 1269 
TYPE: DNA 
ORGANISM: Porcine 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (9) . . (1103) 
US-09-844-705-12 

Query Match 100.0%; Score 1269; DB 3; Length 1269; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1269; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I M I I I I I I I I I I I 

Db 1 CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

Qy 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

M I I I M I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I M I M I I I I I I 

Db 61 TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 

Qy 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I M I I I I I I I I 
Db 121 TGCTGGCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGG 180 

Qy 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

I I I I I i i I M I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I j I M I I I I I I I I I M I 

Db 181 CGGGCACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTT 240 

Qy 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

I I I I I I I I i M t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I t I I I I I 
Db 241 CCGGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCA 300 

Qy 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

I I I I I I I I I I M I I i I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I t I I M I I I I I M I 

Db 301 CGCTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACG 360 

Qy 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 CCGTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGC 420 

Qy 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

I I I I I I I M I I I I I I I i I I I M M M I I I i M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 ACGCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAA 480 

Qy 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

■IIMIIMIIIIIIIIIMI Mlllllll I I I I I I I I I I I I I M 

Db 481 AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

Qy 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 

lllllllilllllllllMMIMMIIIMM I I I I I I I I I I I I I I I I I I I I 

Db 541 GGGAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGG 600 
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o^^v-^lVl_i ocoxv^u xvcauiid i^cuuia lui r\.ppiii^auuii v:7o^n-L,uo oiiu ocaicii xvcduit ua-vyiz-o-t^-z..., A'a^c u ui 



Qy 


601 


TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 


660 






1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 




Db 


601 


TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 


660 


Qy 


661 


ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 




Db 


661 


ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTG 


720 


Qy 


721 


ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 




Db 


721 


ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 


780 


Qy 


781 


TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 


840 






1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 




Db 


781 


TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGJVACATCGACACCTCCCGGGGGG 


840 


Qy 


841 


ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 


900 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 


900 


Qy 


901 


TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 


960 






1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 


960 


Qy 


961 


CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 


1020 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 M 1 1 1 1 1 1 




Db 


961 


CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 


1020 


Qy 


1021 


TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 


1080 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 




Db 


1021 


TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 


1080 


Qy 


1081 


CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 


1140 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1081 


CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 


1140 


Qy 


1141 


AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 


1200 






1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 




Db 


1141 


AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 


1200 


Qy 


1201 


TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 


1260 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 1 1 1 1 1 1 I ■ 1 I I 1 1 1 1 1 1 111! 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1201 


TCCTGTAGCAGGCTGGACACTTATTTCAAGAGTGATTCTAATTGGCTGGACTCAGAGGAA 


1260 


Qy 


1261 


ACCCTGCAG 1269 




Db 


1261 


1 1 1 1 1 1 1 1 1 
ACCCTGCAG 12 69 





RESULT 3 

US-09-051-034A-3 

; Sequence 3, Application US/09051034A 
; Patent No. US20010055584A1 
; GENERAL INFORMATION: 

; APPLICANT: MCKENZIE, IAN FARQUHAR CAMPBELL 
; APPLICANT: SANDRIN, MAURO SERGIO 

; TITLE OF INVENTION: IMPROVED NUCLEIC ACIDS ENCODING A CHIMERIC 

; TITLE OF INVENTION: GLYCOSYLTRANSFERASE 

; FILE REFERENCE: 30562. 6USW0 

; CURRENT APPLICATION NUMBER: US/09/051 , 034A 

; CURRENT FILING DATE: 1998-03-31 
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; PRIOR APPLICATION NUMBER: PCT/AU97/004 92 

; PRIOR FILING DATE: 1997-08-01 

; PRIOR APPLICATION NUMBER: 60/024,279 

; PRIOR FILING DATE: 1996-08-21 

; PRIOR APPLICATION NUMBER: PO1402 

; PRIOR FILING DATE: 1996-08-02 

; NUMBER OF SEQ ID NOS : 14 

; SOFTWARE: Patentin Ver . 2.1 

; SEQ ID NO 3 

LENGTH: 1098 

TYPE: DNA 

ORGANISM: Sus Domesticus 
FEATURE: 
NAME/KEY: CDS 
LOCATION: (1) . . (1098) 
US-09-051-034A-3 

Query Match 86.3%; Score 1094.8; DB 3; Length 1098; 

Best Local Similarity 99.8%; Pred. No. 8.5e-313; 



Matches 


1096; Conservative 0; Mismatches 2; Indels 0; Gaps 


0 






ATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTAGCA 68 






1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M t 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


ATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTAGCA 


60 




Q 
D y 


GCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTGGCC 


128 






1 1 i 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 i 1 1 1 1 1 




Db 


61 


GCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTGGCC 


120 


Ov 


X ^ -7 


CTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGCACG 


188 






1 1 1 1 1 1 1 t 1 1 t 1 1 1 1 1 1 1 t 1 t 1 1 1 ) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 

1 1 E 1 1 1 1 M M 1 M M M 1 M 1 M M M M 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


121 


CTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGCACG 


180 


Qv 


189 


CCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGGACC 


248 






1 1 1 1 1 M M 1 1 1 1 i 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


CCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGGACC 


240 


Qy 


249 


TGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTG 


308 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 




Db 


241 


TGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTG 


300 


Qy 


309 


GCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTG 


368 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 




Db 


301 


GCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTG 


360 


Qy 


369 


GCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCT 


428 






1 1 1 1 1 1 1 1 N 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 




Db 


361 


GCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCT 


420 


Qy 


429 


TGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCC 


488 






1 1 1 1 N M 1 1 1 M 1 1 M t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 




Db 


421 


TGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCC 


480 


Qy 


489 


TGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAGCAG 


548 






1 1 1 M IIII M 1 II 1 1 II 1 1 1 M II 1 II 1 1 1 1 II 1 1 1 1 1 1 




Db 


481 


TGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAGCAG 


540 


Qy 


549 


ATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTGAGT 


608 






II Illlllllllllll 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


ATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTGAGT 


600 
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ocoicii ivcauiia x^cuaiia lui rvppiicauuii u^o^^^uo oiiu ocan.ii xvc:>uii ua-w:7-on^-z,... r age o 



Qy 


609 


CAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTGCGC 

1 N 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 M M 1 1 1 1 1 1 1 t 1 1 1 1 M M 1 1 1 1 1 M 1 1 1 1 1 

CAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTGCGC 


668 


Db 


601 


660 


Qy 


669 


CGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGCCGT 

1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 | 

CGCGGGGACTATCXGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGCGCT 


728 


Db 


661 


720 


Qy 


729 


TACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTGGTC 

INIIMMIIIIIIIIIIIIIIIIIIIIIllMIIIIIMIIIIIIllliiiiiiiiii 

TACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTGGTC 


788 


Db 


721 


780 


Qy 


789 


ACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTGATC 

> 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

ACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTGATC 


848 


Db 


781 


840 


Qy 


849 


TTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAGTGC 

1 N M 1 1 M 1 1 M I M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAGTGC 


908 


Db 


841 


900 


Qy 


909 


AACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGTGGA 

II IIIIIIIIIIIIIIIIIIII 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 I 1 1 

AACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGTGGA 


968 


Db 


901 


960 


Qy 


969 


GATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTTAAA 
N 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I t 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 t 1 M 1 1 1 
GATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTTAAA 


1028 


Db 


961 


1020 


Qy 


1029 


CCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTCCAG 

N llllillllilMlltllMIMIIIIIIIMIIIIIMIIIIIMIIl 

CCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTCCAG 


1088 


Db 


1021 


1080 


Qy 


1089 


ATGTTGGCTGGGCCTTGA 1106 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 
ATGTTGGCTGGGCCTTGA 1098 




Db 


1081 





RESULT 4 

US-09-863-475A-5 

; Sequence 5, Application US/09863475A 
; Patent No. US20020102688A1 
GENERAL INFORMATION: 

APPLICANT: LOWE, JOHN 

TITLE OF INVENTION: METHODS AND PRODUCTS FOR THE SYNTHESIS 

OF OLIGOSACCHARIDE STRUCTURES ON GLYCOPROTEINS, 
GLYCOLIPIDS, OR AS FREE MOLECULES, AND FOR THE ISOLATION 
OF CLONED GENETIC SEQUENCES THAT DETERMINE THESE STRUCTUR 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: OBLON, SPIVAK, McCLELLAND, MAIER & NEUSTADT, 
P.C. 

STREET: 1755 Jefferson Davis Highway, Fourth Floor 
CITY: Arlington 
STATE: Virginia 
COUNTRY: U.S.A. 
ZIP: 22202 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS /MS -DOS 

SOFTWARE: PatentIn Release #1.0, Version #1.25 
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CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/0 9/8 63 , 47 5A 
FILING DATE: 24-May-2001 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 07/914,281 
FILING DATE: 20-JUL-1992 
ATTORNEY /AGENT INFORMATION: 

NAME: Lavalleye, Jean- Paul M. P. 
REGISTRATION NUMBER: 31,451 
REFERENCE/DOCKET NUMBER: 2363-060-55 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (703)521-4500 
TELEFAX: (703)486-2347 
TELEX: 248855 OPAT UR 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 8174 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : unknown 
TOPOLOGY: unknown 
MOLECULE TYPE: DNA (genomic) 
ANTI -SENSE: NO 

SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-863-475A-5 

Query Match 63.7%; Score 808.6; DB 3; Length 8174; 

Best Local Similarity 79.6%; Pred. No. 5.4e-228; 

Matches 993; Conservative 0; Mismatches 249; Indels 5; Gaps 3; 

CTCGAGCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTG 60 

M I I I I I I I I I I I I t I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I 

CTGCAGCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTG 4737 

TTTTAGCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACC 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



II I I I I I II I I I I II I I I I I II I I I II II I II I I I II II I I II 



I I I I I I I I I I I I II II I I I I II I I I I I I I I I I I I I I II I II I I I I I 



I I I I I I I M I M I I II I I i I II I I I I I I I I I I II I I I I I I I I I I II I I I I II I 



I I i I I I I I I I Mill II I I I I I I I I I I t I I I I II I I I I I I I I I II I I I I I M I I 



IN I I I I II I I I M I I I I I I I M M I I I I I II Mill II II II I II I I I 



I I II II M M II II I I I II I M I I II II I I I I I I I I II II II II I I I I I 



Qy 


1 


Db 


4678 


Qy 


61 


Db 


4738 


Qy 


121 


Db 


4798 


Qy 


181 


Db 


4858 


Qy 


241 


Db 


4918 


Qy 


301 


Db 


4978 


Qy 


361 


Db 


5038 


Qy 


421 


Db 


5098 
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►J^^ANJ-^ occuL/11 xvc;auiid j^c^ioiia lui r\yyii^aii\jii v:7o^n^uo oiiu ocaii^ii xvcduii ud-v:7-on^... x'a^c iv ui 



yy 


*1 O-L 


UD 


1 CO 


yy 


DHJ. 


uD 


O 1 Q 


yy 


oU J. 


UD 


. C O T O 


Qy 


bbl 


DD 


c o o o 


Qy 


T O 1 
f Z± 


DD 


5398 


Qy 


T O 1 

/ol 


DD 


5458 


Qy 




Db 


5518 


Qy 


y Ul 


Db 


5578 


Qy 


y bi 


Db 


5638 


Qy 


1021 


DD 


o598 


Qy 


lUol 


DD 


5758 


Qy 


1141 


Db 


5816 


Qy 


1201 


Db 


5875 



AGGAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCC 540 

II II I I I I I I I I II I II I II I I I I I I I I I Mill I I I I I II M II I I I I 
GAGATCCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCC 5217 



I I I I I II II II I I I I II M II II I I I II II I II I II I I I I I I I I I II M Ml I I 



TACTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCC 660 

I Ml Mill MM II I II I II I I I II II M M I M II M I II II I II I 

TGCTGGGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCC 5337 

ACGTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAG.CGCTGGAAGGGGGTGGTGGGTG 720 

I I M II II II I M II II II II I I II Mill II M I II I II II I II I I M II I 
ACGTGCGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCG 5397 

ACGGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCT 780 

II II I II II II II II I I M II II I II I M I II II 11' II II II II I II I I I 
ACAGCGCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTT 5457 

TTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGG 840 
I M II It I II I II I I I II II I II I II II I I I II I I II M I I II II I I III I 

TCGTGGTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCG 5517 

ACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGG 900 
I II II I II I II I I I II II I I I II I I I III I I II II I II I II I I 

ATGTGACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCA 5577 

TGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGG 9 60 

I I I I I II I M I II II I I II II II I II II II II I II II II II II II I I II II II II I 
CACAGTGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGG 5637 

CTGGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGA 1020 
I I I I Mill II II II I II II II II II II II II II M I I II I I M I I 

CTGGCGGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGA 5697 

TCTTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 1080 
I I I I II I II II II I It II I II II II II II II II II II II II II II II I I I II I M II 
TCTTTAAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTC 5757 

CACTCCAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTC 1140 
Mill II I M I I I M II It I It II It I II I I I I III 

CACTCTGGACATTGGCTAAGCCTTGA—GAGCCAGGGAGACTTTCTGAAGTAGCCTGATC 5815 

AACCCAGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGG 1200 

I I I I I I I M It t I I I I II 11 I I 1 I It 1 I II M I I II II I I 
TTTCTAGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGG 5874 

TCCTGTAGCAG — GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 124 5 
I I III I II I I I I II II It I II II I I I II 

TGTTCCTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 5921 



RESULT 5 
US-10-105-963-9 

; Sequence 9, Application US/10105963 

; Publication No. US20030068818A1 

; GENERAL INFORMATION: 

; APPLICANT: Geron Corporation 

; APPLICANT: Denning, Chris 
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APPLICANT: Clark, A. John 
APPLICANT: Schiff, J. Michael 

TITLE OF INVENTION: Animal Tissue with Carbohydrate Antigens Compatible for Human 
TITLE OF INVENTION: Transplantation and a Carbohydrate Determinant Selection Syste 
TITLE OF INVENTION: Recombination 
FILE REFERENCE: 731/002 

CURRENT APPLICATION NUMBER: US/10/105,963 
CURRENT FILING DATE: 2002-03-21 
PRIOR APPLICATION NUMBER: US 60/277,811 
PRIOR FILING DATE: 2001-03-21 
NUMBER OF SEQ ID NOS : 40 
SOFTWARE: Patentin version 3.1 
SEQ ID NO 9 

LENGTH: 3373 
' TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (104) . . (1201) 
OTHER INFORMATION: 
US-10-105-963-9 

Query Match 63.6%; Score 806.8; DB. 6; Length 3373; 

Best Local Similarity 79.7%; Pred. No. 1.5e-227; 

Matches 990; Conservative 0; Mismatches 247; Indels 5; Gaps 3; 



Qy 


6 


GCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTA 

1 1 1 1 1 t 1 1 1 III Mil II II 1 1 1 1 1 III 1 1 1 1 1 1 1 1 • 

>>>'>'■■> > > 1 INI M M 1 M M IN 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 
GCCATGTGGCTCCGGAGCCATCGTCAGCTCTGCCTGGCCTTCCTGCTAGTCTGTGTCCTC 


65 


Db 


101 


160 


Qy 


66 


GCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTG 

1 i 1 1 I 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

TCTGTAATCTTCTTCCTCCATATCCATCAAGACAGCTTTCCACATGGCCTAGGCCTGTCG 


125 


Db 


161 


220 


Qy 


126 


GCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGC 

INIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

ATCCTGTGTCCAGACCGCCGCCTGGTGACACCCCCAGTGGCCATCTTCTGCCTGCCGGGT 


185 


Db 


221 


280 


Qy 


186 


ACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGG 
N III II 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCTCCGGC 


245 


Db 


281 


340 


Qy 


246 


ACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTG 
IIIIIIIM 1 Mill lllllllllllll II 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 
ACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCACGCTG 


305 


Db 


341 


400 


Qy 


306 


CTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTC 

IIMI llllllll Ill llllll MM llllllllllll 

CTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATGCCGCC 


365 


Db 


401 


460 


Qy 


366 


CTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCT 

IIIMMI M III Ml Mill II Mill II II M Mill 1 1 1 

CTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCCGCACG 


425 


Db 


461 


520 


Qy 


426 


CCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAG 

II 1 M M II 1 1 1 1 1 II 1 1 II II II II 1 II 1 1 1 1 1 II 1 1 M II II II 1 M 

CCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGAGAGAT 


485 


Db 


521 


580 


Qy 


486 


CCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAG 

II 1 1 M M 1 1 1 1 1 II II 1 1 M II 1 1 1 1 II 1 1 1 1 1 1 1 M M 1 M llllllll 

CCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCCGGGAA 


545 


Db 


581 


640 
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K>^\ji\xj ij^ai^n ivcduiia j^cioiid lui rvp^m^auuii \jyon^z,Kj<^ oiiu ocai\^ii xvcauii u:>-u:7-o-m 



Qy 


546 


CAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTG 

IIIIMIMM lllltllMIIIIIIIIIIIIIIIlM illllll III 1 II III 

CAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTGTGCTG 


n R 

DUO 


Db 


641 


/ u u 


Qy 


606 


AGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 II 1 1 M II 

GGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCCACGTG 


D D O 


Db 


701 


/ \J\J 


Qy 


666 


CGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGC 
Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 II M 1 1 1 1 M 1 1 Ml II 
CGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTRaTaf^arrararr 


f 


■Db 


761 


p 9 n 


Qy 


726 


CGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTG 
illllll 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 II II 1 II 1 1 1 1 II M 1 1 M III 
GCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCrrr^TTTTraTr; 


/ 0 o 


Db 


821 


ft ft n 

O 0 u 


Qy 


786 


GTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTG 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II III II III 
GTC ACCAGC AAC GGC ATGGAGTGG T GTAAAGAAAAC ATC G AC AC r Tr r r fl f:;af::r r a T r T r 


ft 4 c: 
OHO 


Db 


881 




Qy 


846 


ATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAG 
1 i 1 i 1 1 II 1 1 1 1 II 1 1 1 1 II II III 

AC GT T TGC TGGCGAT GGAC AGGAGGC TAG AC C GTGGAAAG AP T T Tar r r Tar TP ap n rn r 


905 


Db 


941 


XUUU 


Qy 

Db 


906 
1001 


TGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGT 
1 N 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 II II 1 1 

TGC AACC AC ACC AT T ATGAC C AT TGGC ACC T T C GGC T TP T aac^r Tar ri^Tmr^rrr'^rnr' 


Q £ R 

1 n c A 
iUbU 


Qy 


966 


GGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTT 
Illllll 1 II 1 1 1 1 1 1 1 1 1 t 1 II 1 1 1 M 1 1 II 1 II 1 II 1 1 1 t 1 II II 
GGAGAC AC TGTC T AC C TGGC CAAC T T C ACC C T GC C AGAC TCTGAaTTPPTaanpnTrTTT 


1025 


Db 


1061 




Qy 

Db 


1026 
1121 


AAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 

II 1 > 1 1 1 1 1 li 1 M 1 M 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 11 1 1 II 1 1 1 1 

AAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATC;PAr;APTTr:TPTrr2irTr 


1 Oft 
J. uo o 

lib U 


Qy 


1086 


CAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTCAACCC 

t 1 1 1 1 1 1 1 Illllll Illllll II III 1 III 1 
TGGACAT TGGC TAAGCC T T GA — GAGC C AGGGAGAC T T TC TGAAGT AGP PTRATPTTTpt 


llfl 0 


Db 


1181 


1 9 "3 Q 
IZ OO 


Qy 


1146 


AGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGGTCCTG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGGTGTTC 


1205 


Db 


1239 


1297 


Qy 


1206 


TAGCAG — GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 124 5 

III Ml 1 1 1 1 i II 1 1 1 1 1 1 1 II 1 1 1 1 
CTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 1339 




Db 


1298 





RESULT 6 
US-11-219-419-9 

; Sequence 9, Application US/11219419 

; Publication No. US20060057719A1 

; GENERAL INFORMATION: 

; APPLICANT: Geron Corporation 

; APPLICANT: Denning, Chris 

; APPLICANT: Clark, A. John 

; APPLICANT: Schiff, J. Michael 

; TITLE OF INVENTION: CARBOHYDRATE DETERMINANT SELECTION 
; FILE REFERENCE: 139/003d 
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; CURRENT APPLICATION NUMBER: US/11/219,419 

; CURRENT FILING DATE: 2005-09-02 

; PRIOR APPLICATION NUMBER: US 60/277,811 

; PRIOR FILING DATE: 2001-03-21 

; PRIOR APPLICATION NUMBER: US 10/105,963 

; PRIOR FILING DATE: 2002-03-21 

; NUMBER OF SEQ ID NOS : 40 

; SOFTWARE: Patentin version 3.3 

; SEQ ID NO 9 

LENGTH: 3373 

TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/KEY: CDS 
LOCATION: ( 104 )..( 1201 ) 
US-11-219-419-9 



Query Match 63.6%; Score 806.8; DB 16; Length 3373; 

Best Local Similarity 79.7%; Pred. No. 1.5e-227; 

Matches 990; Conservative 0; Mismatches 247; Indels 5; Gaps 3; 



Ov 


D 


GCCATGTGGGTCCCCAGCCGCCGCCACCTCTGTCTGACCTTCCTGCTAGTCTGTGTTTTA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

(j*^ u/\ i 1 (j^jU 1 U U Ijtj AtirU A 1 U i O A(jC T C T G C C T G GC C 


65 


Db 




160 


Ov 


66 


GCAGCAATTTTCTTCCTGAACGTCTATCAAGACCTCTTTTACAGTGGCTTAGACCTGCTG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1^ xvji/^iL, 1 1 ^ X i^^-'i^-v^AlAJ.UUAiUAACjAUAGL. xTTCCACATGGCCTAGGCCTGTCG 


125 


Db 


161 


220 


Ov 


126 


GCCCTGTGTCCAGACCATAACGTGGTATCATCTCCCGTGGCCATATTCTGCCTGGCGGGC 

M 1 1 1 1 1 1 1 1 1 1 M Mill II 1 II 1 1 1 II II 1 1 1 1 II II II 1 II 1 

-L \j J. vj J. \^^j-ivjrt^v^o\>v^ijv.^v^ xoijivj/\^^/\^o^^^/\ j_ i c KjCCTGCCGGGT 


185 


Db 


221 


280 


Qy 


186 


ACGCCGGTACACCCCAACGCCTCCGATTCCTGTCCCAAGCATCCTGCCTCCTTTTCCGGG 
II III 11 M 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 II 1 1 1 1 1 1 II 1 II 1 1 II M 1 
ACTGCGATGGGCCCCAACGCCTCCTCTTCCTGTCCCCAGCACCCTGCTTCCCTCTCCGGC 


245 


Db 


281 


340 


Qy 


246 


ACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACGCTG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I 1 1 II 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 
ACCTGGACTGTCTACCCCAATGGCCGGTTTGGTAATCAGATGGGACAGTATGCCACGCTG 


305 


Db 


341 


400 


Qy 

Db 


306 
401 


CTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCCGTC 

1 1 1 N 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 

CTGGCTCTGGCCCAGCTCAACGGCCGCCGGGCCTTTATCCTGCCTGCCATGCATGCCGCC 


365 
460 


Qy 


366 


CTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCT 
1 1 1 1 1 1 II II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II II II Mill 1 1 1 
CTGGCCCCGGTATTCCGCATCACCCTGCCCGTGCTGGCCCCAGAAGTGGACAGCCGCACG 


425 


Db 


461 


520 


Qy 


426 


CCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGGAG 

UN i II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCGTGGCGGGAGCTGCAGCTTCACGACTGGATGTCGGAGGAGTACGCGGACTTGAGAGAT 


485 


Db 


521 


580 


Qy 


486 


CCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGGAG 
II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCTTTCCTGAAGCTCTCTGGCTTCCCCTGCTCTTGGACTTTCTTCCACCATCTCCGGGAA 


545 


Db 


581 


640 


Qy 


546 


CAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTG 
illllllllll lllllllllllllllllllllllllll lllllll III 1 II III 
CAGATCCGCAGAGAGTTCACCCTGCACGACCACCTTCGGGAAGAGGCGCAGAGTGTGCTG 


605 


Db 


641 


700 
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"-sv^vyxvu* oc;ait/ii xvcduiid x^CLoiid lui r%.yyii^aii\Jii u3^o*tn^uo oiiu ocai^ii xvcduii ua-wj^-o-rn.. 



r a^c i-T yjx 



Qy 606 AGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTG 665 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II II II I I I I I I I 
Db 701 GGTCAGCTCCGCCTGGGCCGCACAGGGGACCGCCCGCGCACCTTTGTCGGCGTCCACGTG 760 

Qy 666 CGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGC 725 

II I II I I I I M I I I I I M II I II I I I II M I II M I I I I I I I I I I I III II 

Db 761 CGCCGTGGGGACTATCTGCAGGTTATGCCTCAGCGCTGGAAGGGTGTGGTGGGCGACAGC 820 

Qy 726 CGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTG 785 

I I I I I II I I I I I I II I I I I II M I I II I M II I I M I I I I I I I M II III 

Db 821 GCCTACCTCCGGCAGGCCATGGACTGGTTCCGGGCACGGCACGAAGCCCCCGTTTTCGTG 880 

Qy 78 6 GTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTG 845 

I I I I II I I I I I I M I I I I I I I I I I I I I II M I M I I I I I I I I I III II III 

Db 881 GTCACCAGCAACGGCATGGAGTGGTGTAAAGAAAACATCGACACCTCCCAGGGCGATGTG 940 

Qy 846 ATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAG 905 

I I I I 1 I I I I I I I M I I II I I II III I I I I I I I II I I I I I III 

Db 941 ACGTTTGCTGGCGATGGACAGGAGGCTACACCGTGGAAAGACTTTGCCCTGCTCACACAG 1000 

Qy 906 TGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTGGT 965 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I II I I I II I II I II I I I 
Db 1001 TGCAACCACACCATTATGACCATTGGCACCTTCGGCTTCTGGGCTGCCTACCTGGCTGGC 1060 

Qy 966 GGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATCTTT 1025 

I I I II I I I I I I I I II I I I I I I I I I I I I I I I II M II M II I I I I I I I 

Db 1061 GGAGACACTGTCTACCTGGCCAACTTCACCCTGCCAGACTCTGAGTTCCTGAAGATCTTT 1120 

Qy 1026 AAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 1085 

II II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I II I II 1 II I I I I II I 

Db 1121 AAGCCGGAGGCGGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCACTC 1180 

Qy 1086 CAGATGTTGGCTGGGCCTTGAACCAGCCAGGAGCCTTTCTGGAATAGCCTCGGTCAACCC 1145 

II I I I I I I I I I I I I I I I I I I I I II IN I III I 

Db 1181 TGGACATTGGCTAAGCCTTGA — GAGCCAGGGAGACTTTCTGAAGTAGCCTGATCTTTCT 1238 

Qy 114 6 AGGGCCAGCGTTATGGGTCTCCGGAAGCCCGAGTAACTTCCGGAGATGCTGGTGGTCCTG 1205 

I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 1239 AGAGCCAGCAGTACGTGGCTTCAGAGGCCTG-GCATCTTCTGGAGAAGCTTGTGGTGTTC 1297 

Qy 1206 TAGCAG — GCTGGACACTTATTTCAAGAGTGATTCTAATTGG 1245 

III III I I M I I I I 1. 1 I I I I I I I I M 

Db 1298 CTGAAGCAAATGGGTGCCCGTATCCAGAGTGATTCTAGTTGG 1339 



RESULT 7 

US-09-051-034A-1 

; Sequence 1, implication US/09051034A 

; Patent No. US20010055584A1 

; GENERAL INFORMATION: 

; APPLICANT: MCKENZIE, IAN FARQUHAR CAMPBELL 
; APPLICANT: SANDRIN, MAURO SERGIO 

; TITLE OF INVENTION: IMPROVED NUCLEIC ACIDS ENCODING A CHIMERIC 

; TITLE OF INVENTION: GLYCOSYLTRANSFERASE 

; FILE REFERENCE : 30562 . 6USW0 

; CURRENT APPLICATION NUMBER: US/09/051, 034A 

; CURRENT FILING DATE: 1998-03-31 

; PRIOR APPLICATION NUMBER: PCT/AU97/004 92 

; PRIOR FILING DATE: 1997-08-01 

; PRIOR APPLICATION NUMBER: 60/024,279 

; PRIOR FILING DATE: 1996-08-21 
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o^v^ivi:. ocaii.li xvcauiia i^ctaiia lui r^^yii^au^u u^o^n^uo oiiu ocaiwi Avcauii ua-v^-o-r^... r a^c x j ux z.>r 

PRIOR APPLICATION NUMBER: PO1402 
PRIOR FILING DATE: 1996-08-02 
NUMBER OF SEQ ID NOS : 14 
SOFTWARE: PatentlnVer. 2.1 
SEQ ID NO 1 
LENGTH: 1043 
TYPE: DNA 

ORGANISM: Sus Domesticus 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (9) . . (1031) 
US-09-051-034A-1 

Query Match 35.8%; Score 454.2; DB 3; Length 1043; 

Best Local Similarity 71.8%; Pred. No. 2e-123; 

Matches 610; Conservative 0; Mismatches 233; Indels 6; Gaps 1; 
Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

>> I MTII Ml I llllllllllllll MM IIIM 

Db 183 GGCATGTGGACGATCAATGCCATCGGCCGCCTGGGGAACCAGATGGGGGAGTACGCCACC 242 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

III MM Mill II I MIMMIIII II Ml j 

Db 243 CTGTACGCGCTGGCCAGGATGAACGGGCGGCCGGCCTTCATCCCGCCCGAGATGCACAGC 302 

Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

I N M I II II I II M I I M II II II III 

Db 303 ACGCTGGCCCCCATCTTCAGGATCACCCTCCCGGTCCTGCACGCCAGCACGGCCCGCAGG 362 

Qy 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

I I I I I I I I III II I I I I I II II Mi M I I I I I I 

Db 363 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGCGGTACCGCCACATCCCG 422 

Qy 483 GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 

I I I II Mill MM I I I I II II II II M II II M II II II II I II 

Db 423 GGGGAGTACGTGCGCCTCACGGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTGCGC 482 

Qy 543 GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

IMMI I I llllll Ml Ml IIIIMM I 

Db 483 ACCGAGATCCTCCGGGAGTTCACCCTGCATAACCACGTGCGCGAGGAGGCCCAGGATTTC 542 

Qy 603 CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 

I I I I I I M I I Ml Mill I II M II II II I 

543 CTGCG GGGTCTGCGGGTGAACGGGAGCCGACCGAGTACCTACGTGGGGGTGCAC 596 

Qy 663 GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 722 

I I M M II M II I M I Ml I II II II II II I I I M I M I M II II III 

Db 597 GTGCGCCGGGGGGACTACGTGCACGTGATGCCCAACGTGTGGAAGGGCGTGGTGGCCGAC 656 

Qy 723 GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 782 

I I I M M I II M II I II M II I II II I M M I I f M II I II II M M 

Db 657 CGGCGGTACCTGGAGCAGGCCCTGGACTGGTTCCGGGCTCGCTACCGCTCCCCCGTCTTT 716 



Qy "793 GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 842 

IIIIMM I II I I I II I I I I I I II II II II 

GTGGTCTCCAGCAACGGCATGGCCTGGTGTCGGGAAAACATCAATGCCTCGCGCGGCGAT 77 6 



Db 717 



Qy 343 GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 902 

III I Mill III MM 11 Mill Mill IIMIIII I 

GTGGTGTTTGCCGGCAATGGCATCGAGGGCTCCCCCGCCAAAGACTTCGCGCTGCTCACG 836 



Db 777 
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o^v^xvij ocoii^ii ivcauita i^ctaiia lUi rvyyn^auuu v^o^^^uo miu ocaii^ii xvcauit ua-v:7-o^^... i a^c 



Qy 903 CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 962 

Mill I I I II M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I i I I ' 

Db 837 CAGTGTAACCACACTGTCATGACCATTGGCACGTTCGGGATCTGGGCCGCCTACCTTGCT 8 96 

Qy 963 GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 1022 

I II I I II I I I II I M I I I II I I I I I I I M t I II II I M I I I II 

Db 897 GGTGGAGAGACCATCTACCTGGCCAATTACACGCTCCCGGACTCTCCCTTCCTCAT^CTC 956 

Qy 1023 TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 1082 

Mill I II II I I I I I I I I I I I I I I I I I I I I I I II II I M I I II I I I I III 
Db 957 TTTAAGCCCGAGGCAGCCTTCCTGCCCGAGTGGATTGGGATCGAGGCAGACCTGTCCCCA 1016 

Qy 1083 CTCCAGATG 1091 

I I I I I I 
Db 1017 CTCCTTAAG 1025 



RESULT 8 

US-09-254-077A-5 

; Sequence 5, Application US/09254077A 

; Publication No. US200200314 94A1 

; GENERAL INFORMATION: 

; APPLICANT: SANDRIN, MAURO S. 

; APPLICANT: MCKENZIE, IAN C. F. 

; TITLE OF INVENTION: NUCLEIC ACIDS FOR REDUCING CARBOHYDRATE EPITOPES 
; FILE REFERENCE: 30562. 5USW0 

; CURRENT APPLICATION NUMBER: US/09/254 , 077A 
; CURRENT FILING DATE: 1999-06-11 

PRIOR APPLICATION NUMBER: PCT/AU97/00540 
; PRIOR FILING DATE: 1997-08-22 

PRIOR APPLICATION NUMBER: PO 1823 
; PRIOR FILING DATE: 1996-08-23 
; NUMBER OF SEQ ID NOS : 12 
; SOFTWARE: Patentin Ver . 2.1 
; SEQ ID NO 5 

LENGTH: 1043 
TYPE: DNA 

ORGANISM: Sus scrofa 
US-09-254-077A-5 



Query Match 35.8%; Score 454.2; DB 3; Length 1043; 

Best Local Similarity 71.8%; Pred. No. 2e-123; 

Matches 610; Conservative 0; Mismatches 2 33; Indels 6; Gaps 



Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

Ml II II I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I II I 

Db 183 GGCATGTGGACGATCAATGCCATCGGCCGCCTGGGGAACCAGATGGGGGAGTACGCCACC 242 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

Ml I I M I I I I I I I I t I II I I I II I I I I I I I I I I I I I I t I I I 
Db 243 CTGTACGCGCTGGCCAGGATGAACGGGCGGCCGGCCTTCATCCCGCCCGAGATGCACAGC 302 

Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

I II I I M I I I M I I I I I I I I I I I I I I I II I i III 

Db 303 ACGCTGGCCCCCATCTTCAGGATCACCCTCCCGGTCCTGCACGCCAGCACGGCCCGCAGG 362 

Qy 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

M M II I I I I I I I I I I I I I I I I III II 11 I I I I 

Db 363 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGCGGTACCGCCACATCCCG 422 

Qy 483 GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 
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o\^v/xNj_:, ocaii^ii xvcauiid j-^c^iaiia lui r\.yyii^aiiun w^ot^^uo aiiu ocaiwn xvcsuii ua-v/:7-oT*i-... x'agc i / ui -^:7 



Db 423 GGGGAGTACGTGCGCCTCACGGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTGCGC 482 

Qy 543 GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

I I I I I I I I I I i I I I I I I I I I I I I I I I I I. I I I I I I I I I I I I I I 
Db 483 ACCGAGATCCTCCGGGAGTTCACCCTGCATAACCACGTGCGCGAGGAGGCCCAGGATTTC 542 

Qy 603 CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 

Mil Mill I I II I I I I I I I I I II I I I I I I I I I I I I 

Db 543 CTGCG GGGTCTGCGGGTGAACGGGAGCCGACCGAGTACCTACGTGGGGGTGCAC 596 

Qy 663 GTGCGCCGCGGGGACTATCTGCGTGTGATGCCC7UVGCGCTGGAAGGGGGTGGTGGGTGAC 722 

M I I I I II II I I I I I I III I I I I II II I I I I I I I I I I I I I I I I I I III 

Db 597 GTGCGCCGGGGGGACTACGTGCACGTGATGCCCAACGTGTGGAAGGGCGTGGTGGCCGAC 656 

Qy 723 GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 782 

I I I I I I II I I II I M I I I I II II I M I I I I I I t I I I I I I I I I II II I 

Db 657 CGGCGGTACCTGGAGCAGGCCCTGGACTGGTTCCGGGCTCGCTACCGCTCCCCCGTCTTT 716 

Qy 783 GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 842 

I I I I I I I I I I II I II t I I I I I I I I I I III I I I I I II I I I II II M II 
Db 717 GTGGTCTCCAGCAACGGCATGGCCTGGTGTCGGGAAAACATCAATGCCTCGCGCGGCGAT 776 

Qy 843 GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 902 

I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I t I I I I I I 

Db 777 GTGGTGTTTGCCGGCAATGGCATCGAGGGCTCCCCCGCCAAAGACTTCGCGCTGCTCACG 836 

Qy 903 CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 962 

I I I I I I M I I I I I I I II II I I I II I I I I I I I I I I I I II I M I I I I I I I I I III 
Db 837 CAGTGTAACCACACTGTCATGACCATTGGCACGTTCGGGATCTGGGCCGCCTACCTTGCT 896 

Qy 963 GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 1022 

I I I I I I I I I I I I I I I I I I I I I II I I II I I I I II I I I M I I I II 

Db 8 97 GGTGGAGAGACCATCTACCTGGCCAATTACACGCTCCCGGACTCTCCCTTCCTCAAACTC 956 

Qy 1023 TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 1082 

I I I I t I I I I I I I I I I II I I I II II II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 957 TTTAAGCCCGAGGCAGCCTTCCTGCCCGAGTGGATTGGGATCGAGGCAGACCTGTCCCCA 1016 

Qy 1083 CTCCAGATG 1091 

I I I I II 
Db 1017 CTCCTTAAG 1025 



RESULT 9 

US-09-954-456-45 

; Sequence 45, Application US/09954456 

; Patent No. US20020115057A1 

; GENERAL INFORMATION: 

; APPLICANT: Young, Paul 

; TITLE. OF INVENTION: Process for Identifying Anti-Cancer Therapeutic Agents Using Ca 

; TITLE OF INVENTION: Sets 

; FILE REFERENCE: 68 9290-76 

; CURRENT APPLICATION NUMBER: US/09/954,4 56 

; CURRENT FILING DATE: 2001-09-18 

; PRIOR APPLICATION NUMBER: US/60/233,617 

; PRIOR FILING DATE: 2000-09-18 

; PRIOR APPLICATION NUMBER: US/60/234,052 

; PRIOR FILING DATE: 2000-09-20 

; PRIOR APPLICATION NUMBER: US/60/234,923 

; PRIOR FILING DATE: 2000-09-25 
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o^^vyivjj, ocoiv^ii rvcauiud j^CLOiia lui r^yyiiK^aiisjn \jyo^^^\jo cuiKX ocan^ii ±vcduiL ua-u:7-o^-t... rage lo ui 



; PRIOR APPLICATION NUMBER: US/60/235,134 

; PRIOR FILING DATE: 2000-09-25 

; PRIOR APPLICATION NUMBER: US/60/235,637 

; PRIOR FILING DATE: 2000-09-26 

; PRIOR APPLICATION NUMBER: US/60/235,638 

; PRIOR FILING DATE: 2000-09-26 

; PRIOR APPLICATION NUMBER: US/60/235,711 

; PRIOR FILING DATE: 2000-09-27 

; PRIOR APPLICATION NUMBER: US/60/235,720 

; PRIOR FILING DATE: 2000-09-27 

; PRIOR APPLICATION NUMBER: US/60/235,840 

; PRIOR FILING DATE: 2000-09-27 

; PRIOR APPLICATION NUMBER: US/60/235,863 

; PRIOR FILING DATE: 2000-09-27 

; NUMBER OF SEQ ID NOS : 227 6 

SOFTWARE: Patentin version 3.0 
; SEQ ID NO 45 

LENGTH: 3088 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-954-456-45 

Query Match 35.4%; Score 449.4; DB 3; Length 3088; i 

Best Local Similarity 71.5%; Pred. No. 6.8e-122; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 

Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I I M I 

Db 295 GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 354 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

III I II I I I II I I I I I II I M I I I I I I I II I I I I I I II II I 

Db 355 CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 414 

Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

II I I I I I I I I I I I I I II I I I I II II II II I II I .1 M 

Db 415 ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 474 

Qy ^ 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

I I I I I I I I III I I I I I I I I I II I I I I I I I I I I I I I I 

Db 475 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 534 

Qy 483 GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 535 GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 594 

Qy 543 GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

I I I I I t I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I II II I I II 
Db 595 CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 654 

Qy 603 CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 

I I I I INI I III I I I I I I I 11 II I I I I I I I I I I I 

Db 655 CTGCG GGGCCTGCAGGTGTUVCGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 708 

Qy 663 GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 722 

I I I I Ill I I I I I I I I I I I I I I I I I I I I I I I I I III 

Db 709 GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 768 

Qy 723 GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 782 

I II I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II 

Db 769 CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 828 
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o^v^ivij, ocaiL.li ivcauita A^ctaiia lui rvppiiL/auuu v^^onn^uo oiiu ocan^ii xvcauit ua-u^-o^^... ra^c ui 



Qy 

Db 


783 
829 


GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 

Mil II MM Ill III iilllll IlilllMII II II 

GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGArarrTrTr ArarTr a T 


Q AO 

Q Q Q 
OOO 


Qy 


843 


GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 

III 1 IIIMI 1 MM 1 II II Mill II II 

GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAflGATTTTarTrTnrTrnrn 


Q n O 


Db 


889 


y ft o 


Qy 


903 


CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 

INN 1 M II 1 1 II 1 1 1 1 1 II II 1 1 M 1 II 1 1 1 II 1 1 1 1 M 1 M 1 Mill 1 

CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGaATrTf:;af::rrrrziTnrrTrnrr 


Q ^ O 


Db 


949 


lUUo 


Qy 


963 


GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 
II 1 1 1 M 1 11 1 1 1 1 i 1 1 1 1 1 II 1 1 1 1 II 1 III 1 1 1 1 1 1 M II Ml 

GGC GGAG AC AC CATC TACC T GGC CAATTACAC Cr T r r r r f:; ar TP r r r T T r r T r n n n n T r* 


T n o o 


Db 


1009 


T n c o 


Qv 


1023 


J. J. xj^irtrt.v-L^v^vj^vjvjL/ X tjL^u X i^Vrf xtjv^uutjAtjlb^CjQTiGGGCATTAATGCAGACTTGTCTCCA 

INN 11 III II 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 II Ml MINI 1 1 1 1 II 

TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 


1082 


Db 


1069 


1128 


Qy 


1083 


CTCCAGATG 1091 




Db 


1129 


1 1 1 1 
TTACTCAAG 1137 




RESULT 
US-09- 


10 

954-456- 


-1621 





Sequence 1621, Application US/09954456 
Patent No. US20020115057A1 
GENERAL INFORMATION: 
APPLICANT: Young, Paul 

TITLE OF INVENTION: Process for Identifying Anti-Cancer Therapeutic Agents Usinq Ca 
TITLE OF INVENTION: Sets 
FILE REFERENCE: 68 9290-76 

CURRENT APPLICATION NUMBER: US/09/954,4 56 
CURRENT FILING DATE: 2001-09-18 
PRIOR APPLICATION NUMBER: US/60/233,617 
PRIOR FILING DATE: 2000-09-18 
PRIOR APPLICATION NUMBER: US/60/234,052 
PRIOR FILING DATE: 2000-09-20 
PRIOR APPLICATION NUMBER: US/60/234,923 
PRIOR FILING DATE: 2000-09-25 
PRIOR APPLICATION NUMBER: US/60/235,134 
PRIOR FILING DATE: 2000-09-25 
PRIOR APPLICATION NUMBER; US/60/235,637 
PRIOR FILING DATE: 2000-09-26 
PRIOR APPLICATION NUMBER: US/60/235,638 
PRIOR FILING DATE: 2000-09-26 
PRIOR APPLICATION NUMBER: US/60/235,711 
PRIOR FILING DATE: 2000-09-27 
PRIOR APPLICATION NUMBER: US/60/235,720 
PRIOR FILING DATE: 2000-09-27 
PRIOR MPLICATION NUMBER: US/60/235,840 
PRIOR FILING DATE: 2000-09-27 , 
PRIOR APPLICATION NUMBER: US/60/235,863 
PRIOR FILING DATE: 2000-09-27 
NUMBER OF SEQ ID NOS : 2276 
SOFTWARE: PatentIn version 3.0 
SEQ ID NO 1621 
LENGTH: 3088 
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ov^vyl^x> ocaiL/ii xvcsuiid i^ciaiid lsji j^\jyn\^au\jiL u:7ot-r-^uo oiiu ocan^ii Avcauii ua-u:7-o^-T.. . x a^c ui -^:7 



TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-954-456-1621 

Query Match 35.4%; Score 44 9.4; DB 3; Length 3088; 

Best Local Similarity 71.5%; Pred. No. 6.8e-122; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 

Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

I I I I I I I I I I I I I I I I I I I I M I I i I M I I I I I I I I I I I M I 

Db 295 GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 354 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

355 CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 414 

Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

I I I I I I I I ri I I I I I M I I I I I I I I I I I I I II I I I I 

Db 415 ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 474 

Qy , 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

I I I M I I I III M II I I I I II I I I I I I II I I I II I I 

Db 475 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 534 

Qy 483 GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 

II I I I I I t I M II I I I I I I I II I I M II I I I I I I II I I II I II II 

Db 535 GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 594 

Qy 543 GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

I I I I I I I I I I I I II I I I I I M II I I I I I I I I I I I I I I II M I I I II 
Db 595 CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 654 

Qy 603 CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 

MM I I I I I III I I I I I I I I I I I I I I M I I I I I I 

Db 655 CTGCG GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 708 

Qy 663 GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 722 

I I I I M I I I I I t I I I I t I I I I I I I I I I I I I I I I I II I I I I I I I I IN 

Db 709 GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 7 68 

Qy 723 GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 782 

I I I M I II I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 769 CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 828 

Qy 783 GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 842 

I M I I I I I M I II I II I I II 11 I I I III I I I I I I I I I I I I I I I I I II 11 
Db 829 GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACGGTGAT 888 

Qy 843 GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 902 

M I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 889 GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 948 

Qy 903 CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 962 

Mill IIIIIIMMM I Ill Nlll I 

Db 94 9 CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGGATCTGGGCCGCATACCTCACG 1008 

Qy 963 GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 1022 

M M MM M I II II II III III II III 

Db 1009 GGCGGAGACACCATCTACCTGGCCAATTACACCCTCCCCGACTCCCCTTTCCTCAAAATC 1068 

Qy 1023 TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 1082 
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Db 1069 TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 1128 

Qy 1083 CTCCAGATG 1091 

I I I I 
Db 1129 TTACTCAAG 1137 



RESULT 11 
US-09-969-347-234 

; Sequence 234, Application US/09969347 

; Patent No. US20020115085A1 

; GENERAL INFORMATION: 

; APPLICANT: Ebner, Reinhard 

; TITLE OF INVENTION: Cancer Gene Determination and Therapeutic Screening Using Signa 
; TITLE OF INVENTION: Sets 
; FILE REFERENCE: 68 92 90-69 

; CURRENT APPLICATION NUMBER: US/09/969,347 
; CURRENT FILING DATE: 2001-10-02 

PRIOR APPLICATION NUMBER: US/60/237,598 
; PRIOR FILING DATE: 2000-10-03 
; PRIOR APPLICATION NUMBER: US/60/237,604 
; PRIOR FILING DATE: 2000-10-03 
; NUMBER OF SEQ ID NOS : 318 

SOFTWARE: Patentin version 3.0 
; SEQ ID NO 234 
LENGTH: 3088 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-969-347-234 

Query Match 35.4%; Score 449.4; DB 3; Length 3088; 

Best Local Similarity 71.5%; Pred. No. 6.8e-122; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 

Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

Mil II I II I I I I I I I I I I I I I I j I I M I I I I I I I I I I I I I I 

2 95 GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 354 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

Ml I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 355 CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 414 

Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

I I I I I I I I I I I I M I I I I I I M I I I I I I I I M I I I I 

Db 415 ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 474 

Qy 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

I I I I I I I I III I I I II I I I M I I I I I I I I I I I I I I I 

Db 475 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 534 

Qy 483 GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 

II I I I I I I I I I M I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I 

Db 535 GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 594 

Qy 543 GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

M I M I II I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I II 
Db 595 CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 654 

Qy 603 CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 

MM I I I I I Ml I I I I I I I M II I I I I II I i I II 
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o\^v^x\j-> xvc^auiLd ivcioii^ lui rvp^ii^auwii u^ui-t^uo oixu ocai^ii xvc^auii u»-v:7-o*t*t. 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



655 CTGCG- 



663 



709 



723 



769 



783 



829 



843 



889 



903 



949 



963 



-GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 708 



GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 722 

I I I I I I I I Ill I I I I M I I M 

GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 7 68 

GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 782 

I M I M II I I I II I I I I I I I I I I I II II I M I I I I I I I I I I I I I 

CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 828 

GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 842 
I I I I I I I I I I I II M I I I I I I I I I I III I I I t I I I I I I I I I II I I II II 
GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACGGTGAT 888 

GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 902 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II II II I I 

GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 948 

CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 9 62 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGGATCTGGGCCGCATACCTCACG 1008 



GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 1022 
II I I I I I I I I I I I I I I I I I I II I I I I I I I III III I I I I I II III 
1009 GGCGGAGACACCATCTACCTGGCCAATTACACCCTCCCCGACTCCCCTTTCCTCAAAATC 1068 

1023 TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 1082 

I I I I I II I I I I I I I 1 I I I I I I I I I I t I 11 Mill I I I I I t I I I I II 
1069 TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 1128 

1083 CTCCAGATG 1091 

I I I I 
1129 TTACTCAAG 1137 



RESULT 12 

US-10-843-641A-3072 

Sequence 3072, Application US/10843641A 
Publication No. US200500644 54A1 
GENERAL INFORMATION: 
APPLICANT: Avalon Pharmaceuticals, Inc. 

TITLE OF INVENTION: Cancer Gene Determination and Therapeutic Screening Using 
TITLE OF INVENTION: Signature Gene Sets 
FILE REFERENCE: 689290-189 

CURRENT APPLICATION NUMBER: US/10/843 , 641A 
CURRENT FILING DATE: 2004-05-12 
PRIOR APPLICATION NUMBER: US/09/873,367 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US/09/954,531. 
PRIOR FILING DATE: 2001-09-18 
PRIOR APPLICATION NUMBER: US/09/954,456 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US/09/962,436 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US/09/962,832 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US/09/964,824 
PRIOR FILING DATE: 2001-09-27 
PRIOR APPLICATION NUMBER: US/09/967,768 
PRIOR FILING DATE: 2001-09-28 
PRIOR APPLICATION NUMBER: US/09/968,007 
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ov^wivu, ocai\^ii xvcauxLd j^cuuia aux i-vpjjiivauuii v/:7 0'-mz.uo oiiu ocaiv^ix xvcduii ua-u:7-o^^. 



; PRIOR FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US/09/969,34 7 

; PRIOR FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US/09/969,708 

; PRIOR FILING DATE: 2001-10-03 

Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 844 7 
; SOFTWARE: PatentIn version 3.0 
; SEQ ID NO 3072 

LENGTH: 3088 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-843-641A-3072 



Query Match 35.4%; Score 449.4; DB 10; Length 3088; 

Best Local Similarity 71.5%; Pred. No. 6.8e-122; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 



Qy 


243 


GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 
till 1 1 1 1 1 II J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t III 

Nil 1 1 1 1 1 II 1 1 II II 1 1 II II II 1 1 1 1 1 II 1 MM Mill 

GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 


302 


Db 


295 


354 


Qy 


303 


CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 

III 1 1 1 1 1 1 1 1 II 1 1 1 1 t 1 II 1 r 1 1 1 1 1 1 1 1 1 11 1 t 1 1 1 ■ 1 

III 1 1 I II 1 1 1 II 1 1 1 1 II II 1 1 1 II M 1 II 1 1 1 M 1 1 1 1 1 

CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 


362 


Db 


355 


414 


Qy 


363 


GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 

1 1 1 1 1 1 1 1 1 1 1 III 1 Mill 1 1 1 1 1 II r 1 1 II I III 

N 1 1 1 1 1 1 f 1 1 III 1 1 M 1 1 1 II 1 1 II III M 1 III 
ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 


422 


Db 


415 


474 


Qy 


423 


GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 

11 [III 1 1 j II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 1 

> ■ f ■ > 1 1 1 III II 1 1 II II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 


482 


Db 


475 


534 


Qy 


483 


GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 

1 1 1 1 i II II t t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t t 1 t 1 
' ■ f I 1 1 1 1 1 t II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II M 1 1 1 

GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 


542 


Db 


535 


594 


Qy 

Db 


543 
595 


GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 

1 1 M M II 1 1 1 M M 1 1 1 II II II 1 II II 1 1 1 M 1 1 1 II II M 1 II 

CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 


602 
654 


Qy 


603 


CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 

INI 1 II 1 1 Ml MM M II IIIIMM 

CTGCG GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 


662 


Db 


655 


708 


Qy 


663 


GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 

1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 IIIMIIIMIIIIII Ml 

GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 


722 


Db 


709 


768 


Qy 


723 


GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 

1 1 1 1 M 1 1 II 1 1 1 1 II 1 II 1 1 M 1 M M 1 M II 1 1 1 IIIIMM 

CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 


782 


Db 


769 


828 


Qy 


783 


GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 

NNMIIill M lllllll Mill III MIMII IIIIIMIII II II 

GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACGGTGAT 


842 


Db 


829 


888 


Qy 


843 


GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 

IN 1 IIIIIMMIIIII MM 1 1 II MM 1 M 

GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 


902 


Db 


889 


948 
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ov^vyivj^ ocoiL^ii Avcauiia i^cuuia lui r^ppin^auuii v^o^-t^uo oiiu ocan^ii xvcauit ud-u>r-o-r-t... r a^c ^-r ui 



903 CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 962 

Mill I II M I II I II II M II I I II II M I I M I II II I I M II I I I I I I 

949 CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGGATCTGGGCCGCATACCTCACG 1008 
963 GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 1022 

I I I I I I I I I I I I II M I I II I I I [| I II 11 I I Ml II I I I I I I I I 

1009 GGCGGAGACACCATCTACCTGGCCAATTACACCCTCCCCGACTCCCCTTTCCTCAAAATC 1068 
1023 TTTAT^CCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 1082 

Mill II I II II I I I I II I I I I I I I I M I M III I I I I I I I I I I II 

1069 TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 1128 

1083 CTCCAGATG 1091 

I I I I 
1129 TTACTCAAG 1137 



RESULT 13, 
US-10-843-641A-4 648 

; Sequence 4648, Application US/10843641A 
; Publication No. US20050064454A1 
; GENERAL INFORMATION: 

; APPLICANT: Avalon Pharmaceuticals, Inc. 

; TITLE OF INVENTION: Cancer Gene Determination and Therapeutic Screening Usinq 
; TITLE OF INVENTION: Signature Gene Sets 
; FILE REFERENCE: 689290-189 

; CURRENT APPLICATION NUMBER: US/10/8 43, ^41A 

; CURRENT FILING DATE: 2004-05-12 

; PRIOR APPLICATION NUMBER: US/09/873,367 

; PRIOR FILING DATE: 2001-06-05 

; PRIOR APPLICATION NUMBER: US/09/954,531 

PRIOR FILING DATE: 2001-09-18 
; PRIOR APPLICATION NUMBER: US/09/954,456 
; PRIOR FILING DATE: 2001-09-25 
; PRIOR APPLICATION NUMBER: US/09/962,436 
; PRIOR FILING DATE: 2001-09-25 
; PRIOR APPLICATION NUMBER: US/09/962,832 
; PRIOR FILING DATE: 2001-09-25 
; PRIOR APPLICATION NUMBER: US/09/964,824 

PRIOR FILING DATE: 2001-09-27 
; PRIOR APPLICATION NUMBER: US/0 9/967,7 68 
; PRIOR FILING DATE: 2001-09-28 
; PRIOR APPLICATION NUMBER: US/0 9/968,007 
; PRIOR FILING DATE: 2001-10-02 
; PRIOR APPLICATION NUMBER: US/09/969,347 
; PRIOR FILING DATE: 2001-10-02 
; PRIOR APPLICATION NUMBER: US/09/969,708 
; PRIOR FILING DATE: 2001-10-03 

Remaining Prior Application data removed - See File Wrapper or PALM. 

NUMBER OF SEQ ID NOS : 8447 
; SOFTWARE: Patentin version 3.0 
; SEQ ID NO 4 648 
LENGTH: 3088 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-843-641A-4648 

Query Match 35.4%; Score 449.4; DB 10; Length 3088; 

Best Local Similarity 71.5%; Pred. No. 6.8e-122; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 
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ov^v^xvu ocmv/ii ivcauita x^cuuia lui r^yyus^ausjii kj:7<j^-^^\jo oiiu ocan^u ivcauiL ua-u:7-o-r-r.. . ra^c ui 



Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

295 GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 354 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

III I II I II II II I I I I I I I I I I I I I I I I I I I I I I I I I M I 

355 CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 414 

Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

M I M I t I I I I I I I I I I I I I I I I I I I I M I II I I I I 

415 ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 474 

Qy 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

M I I I I I I III I I II I I I I I II I I I I t I I I I I I I I I 

475 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 534 

Qy 483 GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 

II I I I I I M I II I I M I I I I II I I I I I I I I I I I It M I I I I II I I 

Db 535 GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 594 

Qy 543 GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

I I I I I I I I I I I I I I I II I II I II I I I II I I I I II I II I I II I II II 
I^b 595 CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 654 

Qy 603 CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 

INI Mil I III I II I I t I II I 1 I II I I I I I II I 

Db 655 CTGCG GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 708 

Qy 663 GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 722 

I I N I I I I I I It I I II I I I I I I II M I I It I I I I 1 I 1 it I I M I III 

Db 709 GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 768 

Qy 723 GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 782 

I I I I I I M I II M I I I I I I I I I I I I I I I I I I I I M I I M I I I I I 

Db 769 CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 828 

Qy 783 GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 842 

I I I I I I I I M I II I I I I I I I M I t I III I I I I I I I I I I II t I II I It II 
Db 829 GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACGGTGAT 888 

Qy 843 GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 902 

Ml I lllllll I III lilt II Mill II II 

I^b 889 GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 948 

Qy 903 CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 962 

I I I I I M I I I I I t I I I I I I I 1 1 I I M M M I I I M I I I I I I I I 11 I I I I I I 

I^b 949 CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGGATCTGGGCCGCATACCTCACG 1008 

Qy 963 GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 1022 

II ii Mill 11 INI II I I I MM II III 

Db 1009 GGCGGAGACACCATCTACCTGGCCAATTACACCCTCCCCGACTCCCCTTTCCTCAAAATC 1068 

Qy 1023 TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 1082 

I M I I II Mill I II II I I M I I I M M I II III I I I I I I I I I I M 

Db 1069 TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 1128 

Qy 1083 CTCCAGATG 1091 

I I I I 
Db 1129 TTACTCAAG 1137 
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RESULT 14 

US-10-843-641A-8363 

; Sequence 8363, Application US/10843641A 
; Publication No. US20050064454A1 
; GENERAL INFORMATION: 

;. APPLICANT: Avalon Pharmaceuticals, Inc. 

TITLE OF INVENTION: Cancer Gene Determination and Therapeutic Screening Using 
; TITLE OF INVENTION: Signature Gene Sets 
; FILE REFERENCE: 68 9290-189 

; CURRENT APPLICATION NUMBER: US/10/843 , 641A 

; CURRENT FILING DATE: 2004-05-12 

; PRIOR APPLICATION NUMBER: US/09/873,367 

; PRIOR FILING DATE: 2001-06-05 

; PRIOR APPLICATION NUMBER: US/09/954,531 

; PRIOR FILING DATE: 2001-09-18 

; PRIOR APPLICATION NUMBER: US/09/954,456 

; PRIOR FILING DATE: 2001-09-25 

; PRIOR APPLICATION NUMBER: US/09/962,436 

; PRIOR FILING DATE: 2001-09-25 

; PRIOR APPLICATION NUMBER: US/09/962,832 

; PRIOR FILING DATE: 2001-09-25 

; PRIOR APPLICATION NUMBER: US/09/964,824 

; PRIOR FILING DATE: 2001-09-27 

; PRIOR APPLICATION NUMBER: US/09/967,768 

; PRIOR FILING DATE: 2001-09-28 

; PRIOR APPLICATION NUMBER: US/09/968,007 

; PRIOR FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US/09/969,347 

PRIOR FILING DATE: 2001-10-02 

PRIOR APPLICATION NUMBER: US/09/969,708 
; PRIOR FILING DATE: 2001-10-03 

; Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 8447 

SOFTWARE: Patentin version 3.0 
; SEQ ID NO 8363 

LENGTH: 3088 

TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-843-641A-8363 



Query Match 35.4%; Score 44 9.4; DB 10; Length 308 8; 

Best Local Similarity 71.5%; Pred. No. 6.8e-122; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 



Qy 


243 


GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 
1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 I 1 1 1 1 1 M 1 I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 


302 


Db 


295 


GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 


354 


Qy 


303 


CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 
Ml 1 1 1 1 f 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATGCCGGCCCAGATGCACAGC 


362 


Db 


355 


414 


Qy 


363 


GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M II 1 1 1 1 
ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 


422 


Db 


415 


474 


Qy 


423 


GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 
1 1 1 1 1 1 1 1 III 1 M M II 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 


482 


Db 


475 


ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 


534 
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ov^vyrvu. oc;aiVii xvcauita i^cioiid lsji r^yyiis^au^Jii \jyo^-^^^o oiiu ocaii^ii xvcauiL ua-vy:7-on-t.. . ra^c / vji 



Qy 


483 


GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 

GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 


542 


Db 


535 


594 


Qy 


543 


GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 

M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 

CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 


602 


Db 


595 


654 


Qy 


603 


CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 
1 1 1 1 INI 1 III 1 1 1 II 1 1 II 1 M 1 1 1 11 1 1 1 1 1 
CTGCG GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 


662 


Db 


655 


708 


Qy 


663 


GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I 1 1 1 1 1 1 1 1 t II 1 1 1 1 1 1 1 1 1 M III 
GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 


722 


Db 


709 


768 


Qy 


723 


GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 


782 


Db 


769 


828 


Qy 


783 


GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 
IIIIIIIIIM II IIMIII 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 
GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACGGTGAT 


842 


Db 


829 


888 


Qy 


843 


GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 It II 1 1 II 1 1 1 1 1 II II 
GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 


902 


Db 


889 


948 


Qy 

Db 


903 
949 


CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 

1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGGATCTGGGCCGCATACCTCACG 


962 
1008 


Qy 


963 


GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 
II 1 1 1 M 1 1 1 1 1 1 1 t 1 1 1 1 1 M 1 1 1 1 1 1 1 III III Mill II III 
GGCGGAGACACCATCTACCTGGCCAATTACACCCTCCCCGACTCCCCTTTCCTCAAAATC 


1022 


Db 


1009 


1068 


Qy 


1023 


TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 r II III 1 

■■Ill II 1 1 1 1 1 1 1 II ■ ■ ■ ■ 1 1 1 1 1 II II II III 1 1 1 1 1 1 1 1 1 1 II 

TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 


1082 


Db 


1069 


1128 


Qy 


1083 


CTCCAGATG 1091 




Db 


1129 


1 1 1 1 
TTACTCAAG 1137 





RESULT 15 
US-10-756-149-275 

; Sequence 275, Application US/10756149 

; Publication No. US20050181375A1 

; GENERAL INFORMATION: 

; APPLICANT: Aziz, Natasha 

; APPLICANT: Zlotnik, Albert 

; TITLE OF INVENTION: NOVEL METHODS OF DIAGNOSIS OF METASTATIC CANCER, COMPOSITIONS A 
; TITLE OF INVENTION: METHODS OF SCREENING FOR MODULATORS OF METASTATIC CANCER 
; FILE REFERENCE: file 

; CURRENT APPLICATION NUMBER: US/10/756,149 
; CURRENT FILING DATE: 2004-01-12 
; NUMBER OF SEQ ID NOS : 5818 
; SOFTWARE: PatentIn version 3.2 
; SEQ ID NO 275 
LENGTH: 3088 



http://es/ScoreAccessWeb/Gefltem.action?AppId=09844268&seqId=824814&ItemN^ 11/15/06 



ov^vyxv^ ocoit/ii xvc^duiid rycioiid lui r^ppin^auuii \j:70^-t^\jo oiiu ocaii^ii xvcauii ua-u:7-o-T-r... x a^c ux 



TYPE: DMA 

ORGANISM: Homo Sapiens 
US-10-756-149-275 

Query Match 35.4%; Score 449.4; DB 10; Length 3088; 

Best Local Similarity 71.5%; Pred. No. 6.8e-122; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 



Qy 


243 


GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 
INI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 
GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 


302 


Db 


295 


354 


Qy 


303 


CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 
1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 
CTGTACGCCCTGGCCAAGATGAACGGGCGGCCGGCCTTCATCCCGGCCCAGATGCACAGC 


362 


Db 


355 


414 


Qy 


363 


GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 

1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II MM 
ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 


422 


Db 


415 


474 


Qy 


423 


GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 
1 1 1 1 1 1 1 t III II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M II II 1 
ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 


482 


Db 


475 


534 


Qy 


483 


GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 

II 1 1 M 1 1 II 1 1 i II 1 II 11 1 II 1 1 1 M 1 1 1 II 1 II 1 1 1 1 II 1 1 1 

GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 


542 


Db 


535 


594 


Qy 


543 


GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 

M 1 1 M 1 1 1 1 M M 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 1 1 1 1 1 1 t 1 1 II 
CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 


602 


Db 


595 


654 


Qy 


603 


CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 

1 N 1^ INI 1 ill 1 II II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 

CTGCG GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 


662 


Db 


655 


708 


Qy 

Db 


663 
709 


GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 

II 1 1 M i 1 1 1 1 1 1 II Mill II MM. II 1 

GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 


722 
768 


Qy 


723 


GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 II 1 M M 1 M M 1 1 1 1 1 1 

CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 


782 


Db 


769 


828 


Qy 

Db 


783 
829 


GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 
i 1 M 11 1 1 1 1 1 II M II t II Mill II 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 II II 
GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACGGTGAT 


842 
888 


Qy 


843 


GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 
III 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 I II • • ■ 1 11 

1 1 1 1 1 1 1 1 1 1 1 M 1 I 1 1 1 II II 1 1 II 1 1 1 1 II Mill II II 

GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 


902 


Db 


889 


948 


Qy 

Db 


903 
949 


CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 

INN Mill MUM II Mill llllllllll lllll 1 

CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGGATCTGGGCCGCATACCTCACG 


962 
1008 


Qy 


963 


GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 

II 1 1 1 1 II 1 II 1 III Mill II III 

GGCGGAGACACCATCTACCTGGCCAATTACACCCTCCCCGACTCCCCTTTCCTCAAAATC 


1022 


Db 


1009 


1068 


Qy 


1023 


TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 


1082 
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ocon^ii xvcauiLd rycioiid lui r^^piiv^atujii v:7o^n^uo oiiu ocaivii xvcauii, ui-v>'-on^... x'a^c yjk 



I I I I I M I I I I I I I I I I I I I I I I I I I I I I II III I II I M I I I I II 

Db 1069 TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 1128 

Qy 1083 CTCCAGATG 1091 

I I I I 
Db 1129 TTACTCAAG 1137 



Search completed: November 14, 2006, 03:17:40 
Job time : 1819 sees 



SCORE 1.3 BuildDate: 12/06/2005 
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^K^\j±\u. ocai\^ii xvcauita i^rciaiia xui T-vp^iioauuii \jy<y-^^^\jo oiiu ocaiL/U ivcauit ua-vj^-on-r-. 



SCORE Search Results Details for Application 
09844268 and Search Result us-09-844-268- 
12.rnpbn. 



Score Home 
Page 



Retrieve Application 
List 



SCORE System 
Overview 



SCORE 
FAQ 



Comments / 
Suggestions 



This page gives you Search Results detail for the Application 09844268 and Search Result us-09- 
844-268-12. rnpbn. 

start 



Go Back to previous page 



GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



CM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



November 13, 2006, 21:00:21 ; Search time 350 Seconds 

(without alignments) 
7176.665 Million cell updates/sec 

US-09-844-268-12 
1269 

1 ctcgagccatgtgggtcccc actcagaggaaaccctgcag 1269 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 2471159 seqs, 989689746 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimtim Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



4942318 



Database 



Published_Applications_NA New:* 



/EMC_Celerra_SIDS3/ptodata/2/pubpna/US09_NEW_PUB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US06_NEW_PUB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US07_NEW_PUB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US08_NEW_PUB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/PCT_NEW_PUB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US10_NEW_PUB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB.seq:* 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB. seql : * 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USll_NEW_PUB. seq2 : * 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/US60_NEW_PUB.seq:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10: 



SUMMARIES 
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Result Query 





JMO . 


Score 


Match 


Length DB 


ID 








Description 




1 


/I /I Q /I 


35 


.4 


3088 


8 


US 


-11 


-266 


-748A-28779 


Q (Tl 1 o n r* *a 






o 
i. 


/I /I O yi 

449.4 


35 


.4 


3088 


8 


US 


-11 


-266 


-748A-56661 








6 


447.8 


35 


.3 


1784 


8 


us 


-11 


-266 


-748A-61104 


SecnjF*nr 

k-* *w IXV^ c 


vJ X X U *3 / r\ 


c 


A 
H 


Q"? 1 O 


29 


.3 


1076 


8 


US 


-11 


-266 


-748A-73885 




f OOO D f J\ 




c 
D 


"3 T 1 O 
O / 1 . Z 


29 


.3 


1076 


8 


us 


-11 


-266 


-748A-126696 






c 


c 
D 


210 


16 


.5 


1000 


8 


us 


-11 


-266 


-748A-210852 




o 1 no c o 
Z J.UO ^ 






101 . 6 


8 


.0 


807 


8 


us- 


-11 


-266 


-748A-369805 




J by 0 Uo , 


c 


o 

O 


lUl . 6 


8 


.0 


807 


8 


us- 


-11 


-266 


-748A-453184 






c 


y 


48.6 


3 


.8 


36535 


6 


US- 


-10- 


-561 


-201-3 




3, Appli 


c 


10 


48.6 


3 


.8 


36604 


6 


US - 


-10 


-561- 


-201-2 




2, Appli 




11 


47.4 


3 


.7 


6573 


9 


US- 


-11- 


-218- 


-305-13034 


O Cs^ Lie 11^ C 


1 "50 "5^ n 
loU J4 , A 


c 


12 


47 


3 


.7 


36462 


6 


US - 


-10- 


-561- 


-201-1 




1 / Appl i 


c 


1 o 


4b . D 


3 


.7 


1158 


9 


US- 


-11- 


-218- 


-305-24825 


O C Li c 1 1 0 


9 ^ p o c TV 
Z 4 O Z O , A 






4 D . D 


3, 


.7 


1196 


9 


US - 


-11- 


-056- 


-355B-63909 




Do 3U y , /\ 


c 


15 


45 


3, 


.5 


1217 


9 


US- 


-11- 


-218- 


-305-24824 


C ^ /-TI 1 a n r" <a 


9 4 P 9 4 n 


c 


1 D 


4 5 


3. 


.5 


3458 


9 


US - 


-11- 


-218- 


-305-24822 


C A /-Tl 1 /^ T-l /-^ /^ 

oequence 


O /I Q O O 7\ 




1 n 
1 / 


4o . o 


3. 


.5 


1382 


9 


US- 


-11- 


-218- 


-305-8602 




pen*? 7\i^ 
o b u ^; , Ap 


c 


18 


43.8 


3, 


.5 


35990 


7 


US - 


-11- 


-177- 


-646-577 


Secjuence 


Oil, App 


c 


1 y 


43.8 


3 . 


. 5 


35994 


7 


US- 


-11- 


-177- 


-646-578 


Sequence 


578, App 




20 


43 . 6 


3. 


A 


1527 


6 


us- 


■10- 


-449- 


-902-7630 


oequence 


/boU, Ap 




21 


43 


3. 


,4 


1799 


9 


us- 


-11- 


-218- 


-305-9772 


Sequence 


y / / 2 , Ap 




22 


43 


3. 


, 4 


2281 


9 


us- 


-11- 


-218- 


-305-5693 


Sequence 


5593, Ap 




23 


42 . 8 


3, 


4 


1175 


6 


us- 


-10- 


-449- 


-902-25325 


oequence 


O C O O C TV 

coSZo, A 




24 


42 . 8 


3. 


4 


1427 


6 


us- 


-10- 


-449- 


-902-12258 


Sequence 


1 O O C O TV 

LZZoo f A 




25 


42 . 6 


3. 


4 


1500 


6 


us- 


-10- 


-449- 


-902-13184 


Sequence 


1 O 1 O /I T\ 

iolo4, A 




£. D 


42 . 4 


3. 


3 


888 


9 


us- 


•11- 


-218- 


-305-18144 


oequence 


1 O 1 /I /I 7\ 

10144 , A 




Z / 


42 . 4 


3. 


3 


1316 


6 


us- 


•10- 


-449- 


•902-322 


oequence 


f App 




28 


42 . 4 


3. 


3 


1411 


6 


us- 


■10- 


-449- 


-902-21269 


oequence 


Zlzby, A 




z y 


42 . 4 


3. 


3 


3000 


6 


us- 


■10- 


-449- 


■902-16305 


Sequence 


1 O A C TV 

IbiOo, A 






42 . 2 


3. 


3 


1411 


9 


us- 


■11- 


-218- 


-305-8608 


oequence 


O £r r\ o TV « 

obUo, Ap 


c 


31 


42 . 2 


3. 


3 


1619 


6 


us- 


■10- 


-449- 


-902-9837 


Sequence 


Q O O -7 TV 

9oJ / , Ap 




32 


42.2 


3. 


3 


2727 


9 


us- 


11- 


-218- 


•305-8606 


Sequence 


Q Cr\ C TV — * 

obUb, Ap 




33 


42 . 2 


3. 


3 


2820 


6 


us- 


10- 


•449- 


•902-25581 


oequence 


O C C Q 1 TV 

ZDDol, A 




34 


42 


3. 


3 


1616 


9 


us- 


11- 


■056- 


■355B-14145 


Sequence 


1 /I 1 /I C TV 

14145, A 




OO 


42 


3. 


3 


1725 


6 


us- 


10- 


■449- 


•902-7753 


oequence 


•7 7 C Q TV — 

tfOOf Ap 




oo 


.41.8 


3. 


3 


1328 


6 


us- 


10- 


449- 


902-9788 


oequerice 


Q-7 p P — 

y / 0 0 , Ap 




6 1 


41 . o 


3. 


3 


2163 


6 


us- 


10- 


•449- 


902-7754 


oequence 


T 7 C /I TV _ 

1 ( OH , Ap 




'3 Q 


41 . o 


3. 


3 


4589 


9 


ys- 


11- 


218- 


305-10133 


Sequence 


1013^ A 






4 1 . b 


3. 


3 


1344 


6 


us- 


10- 


953- 


349-28304 


Sequence 


28304, A 




40 


41 

*1 X . o 


3. 


3 


1344 


9 


TTC 


11- 


rt c 

055- 


355B-68758 


Sequence 


68758, A 




41 


41.6 


3. 


3 


1608 


6 


us- 


10- 


449- 


902-26162 


Sequence 


26162, A 




42 


41.4 


3. 


3 


1548 


6 


us- 


10- 


449- 


902-7534 


Sequence 


7534, Ap 




43 


41.4 


3. 


3 


1717 


9 


us- 


11- 


218- 


305-8216 


Sequence 


8216, Ap 




44 


41.4 


3. 


3 


1926 


9 


us- 


11- 


218- 


305-4011 


Sequence 


4011, Ap 




45 


41.2 


3. 


2 


433 


8 


us- 


11- 


266- 


748A-12462 


Sequence 


12462, A 



ALIGNMENTS 



RESULT 1 

US-11-266-748A-28779 

; Sequence 28779, Application US/11266748A 

; Publication No. US20060134663A1 

; GENERAL INFORMATION: 

; APPLICANT: Harkin, Paul 

; APPLICANT; Johnston, Patrick 
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APPLICANT: Mulligan, Karl 

TITLE OF INVENTION: Transcriptome Microarray Technology and 
TITLE OF INVENTION: Methods of Using the Same 
FILE REFERENCE: 55815-0102 (319189) 
CURRENT APPLICATION NUMBER: US/11/2 66, 748A 
CURRENT FILING DATE: 2005-11-03 
PRIOR APPLICATION NUMBER: EP 04105479.2 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105482.6 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105483.4 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105507.0 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105485.9 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105484.2 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: US 60/662,276 
PRIOR FILING DATE: 2005-03-14 
PRIOR APPLICATION NUMBER: US 60/700,293 
PRIOR FILING DATE: 2005-07-18 
NUMBER OF SEQ ID NOS : 483996 
SOFTWARE: Patentin version 3.3 
SEQ ID NO 28779 
LENGTH: 3088 
TYPE: DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-28779 

Query Match 35.4%; Score 449.4; DB 8; Length 3088; 

Best Local Similarity 71.5%; Pred. No. 3.5e-101; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 



Qy 


243 


GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 

Nil 1 II 1 1 t 1 1 1 1 1 1 1 1 1 II 1 1 II M 1 1 1 M 1 1 |.M 

GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 


302 


Db 


295 


354 


Qy 


303 


CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 
Ml 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 

CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 


362 


Db 


355 


414 


Qy 


363 


GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 

1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 t II 1 1 1 1 

ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 


422 


Db 


415 


474 


Qy 


423 


GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 

1 1 1 1 1 1 1 1 III II 1 1 1 1 1 II II nil Ill 1 

ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 


482 


Db 


475 


534 


Qy 


483 


GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 
II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 M I 1 I 1 1 
GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 


542 


Db 


535 


594 


Qy 


543 


GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 

1 1 1 1 1 t U 1 1 II II 1 1 II II M 1 1 1 1 M II II 

CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 


602 


Db 


595 


654 


Qy 


603 


CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 

1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 1 II M 1 1 1 II M 1 II 1 

CTGCG GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 


662 


Db 


655 


708 
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ov^v/xvij, ocaiL.li Avcauiia i^ctaiia lui r^y^a^^aUKJii vy:7o-rnz.\Ju oiiu ocaiL.ii ivcduit ua-u^- 



Qy 


663 


GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 

II Hill ilMIIIII 1 1 III Mill 1 Illlllllllll III 

GT T CGC C GAGGGGAC TATG TC C AT GTC ATGP r a a n n aTr T r r n n r* r r- r r"m r- r-/™ r*/- 7\ ^ 


722 


Db 


709 


768 


Qy 


723 


GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 
1 II INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II III III 1 II II 

C GGCGAT AC C T AC AGC AGGC C C T GGAC T GGT T r r aa Tr r::r T a r n r r Tr n n rnr- 7\ m m mr* 


782 


Db 


769 


828 


Qy 


783 


GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 

IINII 1 lllllll Mill III lllllll llllllllll II II 

GT GGT CAC C AG TAAT GGC ATGGC C T GGT GT r f^naar: a a r a T TP n r n r r- r- n 7\ m 7\ m 


842 


Db 


829 


888 


Qy 


843 


GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 
Ml 1 llllllllllllll Illlllllllll II Mill II II 

GTGGTGT T T GC TGGC GATGGC aT T Gaf^f;r TP ar r Tr rra n n rn "pn r" m n m 7\ ^ 7\ 


902 


Db 


889 


948 


Qy 


903 


CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 II Mill llllllllll II 1 II 1 

^■"^ ^ .™-i.v., \^T-L\^n\^\^^x v-rt 1 X X (jvjtr/\u o- X X u otjtjA X u i (jtjrtjL. C GC AT AC C T C AC G 


962 


Db 


949 


1008 


Qy 


963 


GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 

N 1 i 1 M 1! 1 II II M II II M 1 1 II 1 M III III II M 1 II III 

\jvj\-.\j\jrt\jrt^rt^^rti X t7^TV^L*/AM.x Xi\L*/\^^uu iCL/L-CCjAC TCCCCTTTCCTCAAAATC 


1022 


Db 


1009 


1068 






i i 1 AAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 
1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 II i 1 1 II 1 II III. 1 M 1 II 1 1 1 1 II 
TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 


1082 


Db 


1069 


1128 


Qy 


1083 


CTCCAGATG 1091 




Db 


1129 


1 1 1 1 
TTACTCAAG 1137 





RESULT 2 

US-11-266-748A-56661 

; Sequence 56661, Application US/11266748A 

; Publication No. US20060134663A1 

; GENERAL INFORMATION: 

; APPLICANT: Harkin, Paul 

; APPLICANT: Johnston, Patrick 

; APPLICANT: Mulligan, Karl 

; TITLE OF INVENTION: Transcriptome Microarray Technology and 

; TITLE OF INVENTION: Methods of Using the Same 

; FILE REFERENCE: 55815-0102 (319189) 

; CURRENT APPLICATION NUMBER: US/11/2 66, 748A 

; CURRENT FILING DATE: 2005-11-03 

; PRIOR APPLICATION NUMBER: EP 04105479.2 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105482.6 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105483.4 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105507.0 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105485.9 

PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105484.2 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: US 60/662,27 6 

; PRIOR FILING DATE: 2005-03-14 
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ov^v/-ivi-> ocaxi^ii xvcauiLd j^c^iaiid rvppiii^auuii \jyo^^^\jo aiiu ocaii^ii avcsuil ua-v/j?-on^-z,... ra^c 



PRIOR APPLICATION NUMBER: US 60/700,293 
PRIOR FILING DATE: 2005-07-18 
NUMBER OF SEQ ID NOS : 483996 
SOFTWARE: Patentin version 3.3 
SEQ ID NO 5 6661 
LENGTH: 3088 
TYPE: DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-56661 

Query Match 35.4%; Score 449.4; DB 8; Length 3088; 

Best Local Similarity 71.5%; Pred. No. 3.5e-101; 

Matches 607; Conservative 0; Mismatches 236; Indels 6; Gaps 1; 

GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

I I I I I I M I I I I I Mill I II I II I I II II I II I I I I I I I II 
GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 354 

CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 
III I II I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I II I 

CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 414 

GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

I I II II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 474 

GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

I I I I I I I I III I I I I I I I I I I I I I I I I I I I II I I I I 

ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACTTCCCG 534 

GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 

II I I I I M I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I 
GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTGGACCTTCTACCACCACCTCCGC 594 

GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I II 
CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 654 

CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 
I I I I I I I I I III I I I II I I I II I I I I I I I I I I I I 

CTGCG GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 708 

GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 722 
I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I III 

GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 768 

GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 782 

I I I I I I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 828 

GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 842 
I I M I I I I I I I II I I I I I I I I I I I I III I I I I I I I I I I I II I I I I II II 
GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACGGTGAT 888 

GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 902 

INI Ill I I I I I I I I I I I I I I I I I 

GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 948 

CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 962 
Mill I II II II I I I II I II II I I I II I M Mill I I 11 I II I I I I II II I 
CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGGATCTGGGCCGCATACCTCACG 1008 



Qy 


243 


Db 


295 


Qy 


303 


Db 


355 


Qy 


363 


Db 


415 


Qy 


423 


Db 


475 


Qy 


483 


Db 


535 


Qy 


543 


Db 


595 


Qy 


603 


Db 


655 


Qy 


663 


Db 


709 


Qy 


723 


Db 


769 


Qy 


783 


Db 


829 


Qy 


843 


Db 


889 


Qy 


903 


Db 


949 
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Ov 


963 


Db 


1009 


Ov 


1023 


Db 


1069 


Qy 


1083 


Db 


1129 



ov^vyxvLi ocoiv/ii xvcauiia j^cuuia lui rv^pm^auuii u^o-t-t^uo oiiu ocaiv^ii ±vc»uit ua-Uv'-o-tn-^... rage u 

GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 1022 
II I I I I I II I II I I I I I I I I II I I I II I I III III I I I I I II III 
GGCGGAGACACCATCTACCTGGCCAATTACACCCTCCCCGACTCCCCTTTCCTCAAAATC 1068 

TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 1082 

INN II Mill I I I II I I I I M I II I I I II III I I I I II MM M 

TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACAGGGATTGCCGCAGACCTGTCCCCC 1128 

CTCCAGATG 1091 
I I I I 



RESULT 3 

US-11-2 66-748A-61104 

Sequence 61104, Application US/11266748A 
Publication No. US20060134663A1 
GENERAL INFORMATION: 
APPLICANT: Harkin, Paul 
APPLICANT: Johnston, Patrick 
APPLICANT: Mulligan, Karl 

TITLE OF INVENTION: Transcriptome Microarray Technology and 
TITLE OF INVENTION: Methods of Using the Same 
FILE REFERENCE: 55815-0102 (319189) 
CURRENT APPLICATION NUMBER: US/11/2 66, 7 48A 
CURRENT FILING DATE: 2005-11-03 
PRIOR APPLICATION NUMBER: EP 04105479.2 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105482.6 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105483.4 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105507.0 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105485.9 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: EP 04105484.2 
PRIOR FILING DATE: 2004-11-03 
PRIOR APPLICATION NUMBER: US 60/662,276 
PRIOR FILING DATE: 2005-03-14 
PRIOR APPLICATION NUMBER: US 60/700,293 
PRIOR FILING DATE: 2005-07-18 
NUMBER OF SEQ ID NOS : 483996 
SOFTWARE: Patentin version 3.3 
SEQ ID NO 61104 
LENGTH: 1784 
TYPE: DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-61104 

Query Match 35.3%; Score 447.8; DB 8; Length 1784; 

Best Local Similarity 71.4%; Pred. No. 7.1e-101; 

Matches 606; Conservative 0; Mismatches 237; Indels 6; Gaps 1; 
Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

INI MM I IIIIMM MM 

Db 245 GGGATGTGGACGATCAATGCGATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 304 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

III IIIIMM II I Mill 11 I IMMIM I 
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Db 


305 


CTGTATGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 


364 


Qy 


363 


GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1111 
ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 


422 


Db 


365 


424 


Qy 


423 


GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 
1 1 II 1 1 1 1 1 II 1 1 1 1 1 M M 1 1 1 II 1 1 II Mill 1 
ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACATCCCG 


482 


Db 


425 


484 


Qy 


483 


GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 

II 1 1 II II 1 1 1 1 1 M 1 1 1 M 1 M 1. 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTAGACCTTCTACCACCACCTCCGC 


542 


Db 


485 


544 


Qy 

Db 


543 
545 


GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 M M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 

CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 


602 
604 


Qy 


603 


CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 

INI 1 1 1 1 1 1 1 1 Mill II M 1 M II M M 1 M 1 

CTGCG GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 


662 


Db 


605 


658 


Qy 


663 


GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 

M 1 II 1 1 1 1 II II II 1 1 1 1 M 1 II II 1 1 1 1 II 1 M II 1 M 1 II 1 III 

GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 


722 


Db 


659 


718 


Qy 

Db 


723 
719 


GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 

1 1 1 1 II 1 1 M M II 1 1 1 1 II II II M 1 M 1 1 M M 1 II 1 1 1 M 1 

CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 


782 
778 


Qy 


783 


GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 

1 1 1 1 1 1 1 1 M 1 t 1 II II 1 M M II 1 II 1 1 1 1 II 1 1 1 1 M II 1 II 1 III 

GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACAGTGAT 


842 


Db 


779 


838 


Qy 


843 


GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 
1 1 1 1 II 1 1 1 1 1 II II M 1 II M 1 1 II 1 II 1 II II M 1 II 1 1 
GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 


902 


Db 


839 


898 


Qy 

Db 


903 
899 


CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 
1 1 1 1 1 1 1 M 1 M 1 1 II II 1 1 1 II II II 1 II II 1 II M II M II 1 I 1 1 II 1 1 
CAGTGTAACCACACCATCATGACCATTGGGACGTTCGGGATCTGGGCCGCATACCTCACG 


962 
958 


Qy 

Db 


963 
959 


GGTGGAGATACCATCTACTTGGCTAACTTCACCCTGCCCACTTCCAGCTTCCTGAAGATC 
II Mill II II II M 1 II 1 1 II 1 Mini III III Mill II III 
GGCGGAGACACCATCTACCTGGCC7UVTTACACCCTCCCCGACTCCCCTTTCCTCAAAATC 


1022 
1018 


Qy 

Db 


1023 
1019 


TTTAAACCCGAGGCTGCCTTCCTGCCCGAGTGGGTGGGCATTAATGCAGACTTGTCTCCA 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 t 1 1 III III 1 1 1 1 t 1 1 1 1 1 II 
1 ■ 1 1 1 t 1 1 M 1 i M 1 1 1 M 1 M 1 1 M M 1 Ml III 1 II 1 II 1 II 1 II 

TTTAAGCCAGAGGCAGCCTTCCTGCCGGAGTGGACGGGGATTGCCGCAGACCTGTCCCCC 


1082 
1078 


Qy 


1083 


CTCCAGATG 1091 




Db 


1079 


1 1 1 1 
TTACTCAAG 1087 





RESULT 4 

US-11-266-748A-73885/C 

; Sequence 73885, Application US/11266748A 

; Publication No. US20060134663A1 

; GENERAL INFORMATION: 

; APPLICANT: Harkin, Paul 
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OK^yjL^ ocaiv.li xvcauiLS x^cioiia r^ppin^auuu yjyo-t-tjL^o aiiu ocan^u ivcauii ua-u^^-o-t-t-^... ra^c o ui 



; APPLICANT: Johnston, Patrick 
; APPLICANT: Mulligan, Karl 

; TITLE OF INVENTION: Transcriptome Microarray Technology and 

; TITLE OF INVENTION: Methods of Using the Same 

; FILE REFERENCE: 55815-0102 (319189) 

; CURRENT APPLICATION NUMBER: US/11/2 66, 748A 

; CURRENT FILING DATE: 2005-11-03 

; PRIOR APPLICATION NUMBER: EP 04105479.2 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105482.6 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105483.4 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105507.0 

; PRIOR FILING DATE: 2004-11-03 

; PRIOR APPLICATION NUMBER: EP 04105485.9 

; PRIOR FILING DATE: 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105484.2 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: US 60/662,276 
; PRIOR FILING DATE: 2005-03-14 
; PRIOR APPLICATION NUMBER: US 60/700,293 
; PRIOR FILING DATE: 2005-07-18 
; NUMBER OF SEQ ID NOS : 483996 
; SOFTWARE: Patentin version 3.3 
; SEQ ID NO 73885 

LENGTH: 1076 

TYPE: DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-73885 . 

Query Match 29.3%; Score 371.2; DB 8; Length 1076; 

Best Local Similarity 70.5%; Pred. No. 5.2e-82; 

Matches 512; Conservative 0; Mismatches 208; Indels 6; Gaps 1; 



Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

I I I I I I I I I M I t I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 782 GGGATGTGGACGATCAATGCGATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 723 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

III I I I I I II t I I I I I I I I I I I I I I I I I I I I I I I MINI I 

Db 722 CTGTATGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 663 

Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

I 11 I I I I I I I I I I I I I I I II I I I I I I I I I I II MM 

Db 662 ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 603 

Qy 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

I I I I I I I I III I I I I I I I I I M I I I I I I I I I I I I I 

Db 602 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACATCCCG 543 

Qy 483 GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 542 GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTAGACCTTCTACCACCACCTCCGC 483 

Qy 543 GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

I I I I I i 11 I I II I I I I I I I II I I I I I I I I I I I II I II I II I M I II 
Db 482 CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 423 

Qy 603 CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 

I I I I I I I I I III I I I I I I I I II II I I I I I I I I I I 
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o^v/x\j-> oc^oii^a xvcauiia x^c;uuid lui -rvppin^aiiuii \jyK>^^^\ja oiiu ocaii^ii ivcauii ua-u:7-onn-z,. 



Db 


422 


CTGCGGG GCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 


369 


Qy 


663 


GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 

II IMII IIIIIMII 1 1 III Mill II IIIIMII II 

GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 


722 


Db 


368 


309 


Qy 

Db 


723 
308 


GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 

Ml MM IIIIIMII IIIIMII 

CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 


782 
249 


Qy 


783 


GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 
M 1 1 1 II II II 1 1 1 1 II 1 II 1 II 1 1 II 1 1 1 1 II 1 1 1 1 1 1 II 1 II 1 MM 
GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACAGTGAT 


842 


Db 


248 


189 


Qy 


843 


GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 

Ml 1 Mill MM II IIIIIMII MM II 

GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 


902 


Db 


188 


129 


Qy 


903 


CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 

Mill 1 II 1 1 1 1 II 1 1 1 1 II 1 1 II II 1 1 II II M II II II 1 M 1 II II 1 

CAGTGTAACCACACCATCATGACCATTGGGACGTTGGGGATCTGGGCCGCATACCCCTCG 


962 


Db 


128 


69 


Qy 


963 


GGTGGA 968 

1 1 1 1 
TGTTGA 63 




Db 


68 





RESULT 5 

US-11-266-748A-126696 

; Sequence 126696, Application US/11266748A 

; Piiblication No. US20060134663A1 

; GENERAL INFORMATION: 

; APPLICANT: Harkin, Paul 

; APPLICANT: Johnston, Patrick 

; APPLICANT: Mulligan, Karl 

; TITLE OF INVENTION: Transcriptome Microarray Technology and 

; TITLE OF INVENTION: Methods of Using the Same 

; FILE REFERENCE: 55815-0102 (319189) 

; CURRENT APPLICATION NUMBER: US/11/2 66, 7 48A 

; CURRENT FILING DATE: 2005-11-03 

PRIOR APPLICATION NUMBER: EP 04105479.2 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105482.6 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105483.4 
; PRIOR FILING DATE: 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105507.0 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105485.9 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105484.2 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: US 60/662,276 
; PRIOR FILING DATE: 2005-03-14 
; PRIOR APPLICATION NUMBER: US 60/700,293 
; PRIOR FILING DATE: 2005-07-18 
; NUMBER OF SEQ ID NOS : 483996 
; SOFTWARE: Patentin version 3.3 
; SEQ ID NO 126696 
LENGTH: 1076 
TYPE : DNA 
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ov>wi>aj. ocoiv/ii Avcauita x^cuuia lkji r%.yyn\^au\jii v^^o^n^uo oiiu ocaiuii xvcauit ua-u:7-o^^... rage lu *ji 

ORGANISM: Homo Sapiens 
US-11-266-748A-126696 

Query Match 29.3%; Score 371.2; DB 8; Length 1076; 

Best Local Similarity 70.5%; Pred. No. 5.2e-82; 

Matches 512; Conservative 0; Mismatches 208; Indels 6; Gaps 1; 

Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

lilt I M I I I I I II I I I I I I I II II I I I II I II I I I I I I I I I I 
Db 295 GGGATGTGGACGATCAATGCGATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 354 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

IN I I I I I I I I I I I I I I II I I I I II I I I I II I I I I I I I I I I 

Db 355 CTGTATGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 414 

Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

M M I I I I I I I III I II I II I I I I I II III II I Ml 

. 415 ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 4 74 

Qy 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

I I I M I I I III I I I I I I I II I I I I I I I I I I I I I I I 

Db 475 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGAGGAATACCGCCACATCCCG 534 

Qy 483 GAGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGG 542 

II I I I I I I I I I I I I II I I M I I I I I I I I M II I I I I I I I I I t I I 

Db 535 GGGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTAGACCTTCTACCACCACCTCCGC 594 

Qy 543 GAGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTA 602 

I I I M I I I I I I I M I I I I I II I I I I I I I I I r I I I I t II I I I I I I II 
Db 595 CAGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTC 654 

Qy 603 CTGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCAC 662 

INI INI I III I I I I I I I II I M I I I M I I I I I 

Db 655 CTGCGGG GCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCAT 708 

Qy 663 GTGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGAC 722 

N NNI MM! I I I II II II Ill III 

Db 709 GTTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGAC 768 

Qy 723 GGCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTT 782 

I II I I II I N I I I I M II I I I II I I I I I I M I I I I I I I I I I I I I 

Db 7 69 CGGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTC 828 

Qy 783 GTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGAC 842 

I I I N N N N I I N I II II I N I I I M I I I I II I I I I I I I I II I | | | 

Db 829 GTGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACATTGACACCTCCCACAGTGAT 888 

Qy 843 GTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTG 902 

IN I IIMIIIIIIIIII I II MM 1! Mill II II 

Db 889 GTGGTGTTTGCTGGCGATGGCATTGAGGGCTCACCTGCCAAAGATTTTGCTCTACTCACA 948 

Qy 903 CAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCT 962 

N I I I I I I I I I I M I I I I II II I I I I I I I I II I M I I 

Db 949 CAGTGTAACCACACCATCATGACCATTGGGACGTTGGGGATCTGGGCCGCATACCCCTCG 1008 

Qy 963 GGTGGA 968 

Nil 

Db 1009 TGTTGA 1014 

RESULT 6 
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'^^^A'^^ ocon^n ivcauita x^cuuia lui r^yyii^auyjii ^jyo^^^\j<^ miu ocan^n ivcauii ua-v/:7-o^*T... r 



US-11-266-748A-210852/C 

; Sequence 210852, Application US/11266748A 

; Publication No. US20060134663A1 

; GENERAL INFORMATION: 

; APPLICANT: Harkin, Paul 

; APPLICANT: Johnston, Patrick 

; APPLICANT: Mulligan, Karl 

; TITLE OF INVENTION: Transcriptome Microarray Technology and 

; TITLE OF INVENTION: Methods of Using the Same 

; FILE REFERENCE: 55815-0102 (319189) 

; CURRENT APPLICATION NUMBER: US/11/2 66, 748A 

; CURRENT FILING DATE: 2005-11-03 

; PRIOR APPLICATION NUMBER: EP 04105479.2 

PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105482.6 
; PRIOR FILING DATE: 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105483.4 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105507.0 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105485.9 
; PRIOR FILING DATE: 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105484.2 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: US 60/662,276 
; PRIOR FILING DATE: 2005-03-14 
; PRIOR APPLICATION NUMBER: US 60/700,293 
; PRIOR FILING DATE: 2005-07-18 
; NUMBER OF SEQ ID NOS : 483996 
; SOFTWARE: Patentin version 3.3 
; SEQ ID NO 210852 
LENGTH: 1000 
TYPE: DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-210852 



Query Match 16.5%; Score 210; DB 8; Length 1000; 

Best Local Similarity 69.1%; Pred. No. 3.7e-42; 

Matches 318; Conservative 0; Mismatches 135; Indels 7; Gaps 2; 



Qy 


365 


CCTGGCCCCCGTGTTCCGCAT-CACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCACG 
1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II III 
CCTGGCCCCCATCTTCAGAATCCACCCTGCCGGTGGTGCAAAGCGCCACGTCCAGCAGGA 


423 


Db 


454 


395 


Qy 


424 


CTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAGG 

IN III lllllll Mill II Mill II 

TCCCCTGGCAGAACTACCACCTGAACGTCTGGATGGAGGAGGAATACCGCCACATCCCGG 


483 


Db 


394 


335 


Qy 


484 


AGCCCTGGCTGAAGCTCACCGGCTTCCCCTGCTCCTGGACCTTCTTCCACCACCTCCGGG 

1 1 1 M 1 II M II 1 1 1 II II II 1 II II M 1 1 II 1 1 

GGGAGTACGTCCGCTTCACCGGCTACCCCTGCTCCTAGACCTTCTACCACCACCTCCGCC 


543 


Db 


334 


275 


Qy 


544 


AGCAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTAC 

i 1 1 1 1 1 1 1 1 1 M M 1 M M 1 II II II M II 1 II 1 1 II 1 II 1 II 1 III 

AGGAGATCCTCCAGGAGTTCACCCTGCACGACCACGTGCGGGAGGAGGCCCAGAAGTTCC 


603 


Db 


274 


215 


Qy 


604 


TGAGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACG 
III 1 1 1 1 1 Mi 1 II 1 1 1 1 M 1 II II 1 1 II II II 1 
TGCG - GGGCCTGCAGGTGAACGGGAGCCGGCCGGGCACCTTTGTAGGGGTCCATG 


663 


Db 


214 


161 


Qy 


664 


TGCGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACG 


723 
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ov^v^Avu. ocaiMX xvcduiLs i^cioiid lkji r^\jyn^aii\Jii yjyo^^^Kji^ oiiu ocaiv^ii xvcauii ua-v:7-ot-r... x a^c i-^ oi 



Db 160 TTCGCCGAGGGGACTATGTCCATGTCATGCCAAAAGTGTGGAAGGGGGTGGTGGCCGACC 101 

Qy 724 GCCGTTACCTCCAGCAGGCTATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTG 783 

I I I I I i I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 100 GGCGATACCTACAGCAGGCCCTGGACTGGTTCCGAGCTCGCTACAGCTCCCTCATCTTCG 41 

Qy 784 TGGTCACCAGCAACGGCATGGAGTGGTGCCGGT^GAACAT 823 

I I I I I I I I I I M I I I I I I I I I I I I III I M I I II 

Db 40 TGGTCACCAGTAATGGCATGGCCTGGTGTCGGGAGAACAT 1 



RESULT 7 

US-11-2 66-748A-369805 

; Sequence 369805, Application US/11266748A 

; Publication No. US20060134 663A1 

; GENERAL INFORMATION: 

; APPLICANT: Harkin, Paul 

; APPLICANT: Johnston, Patrick 

; APPLICANT: Mulligan, Karl 

TITLE OF INVENTION: Transcriptome Microarray Technology and 
; TITLE OF INVENTION: Methods of Using the Same 
; FILE REFERENCE: 55815-0102 (319189) 
; CURRENT APPLICATION NUMBER: US/11/266, 748A 
; CURRENT FILING DATE: 2005-11-03 

PRIOR APPLICATION NUMBER: EP 04105479.2 

PRIOR FILING DATE; 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105482.6 
; PRIOR FILING DATE: 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105483.4 
; PRIOR FILING DATE: 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105507.0 
; PRIOR FILING DATE: 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105485.9 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105484.2 
; .PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: US 60/662,276 
; PRIOR FILING DATE: 2005-03-14 
; PRIOR APPLICATION NUMBER: US 60/700,2 93 
; PRIOR FILING DATE: 2005-07-18 
; NUMBER OF SEQ ID NOS : 483996 

SOFTWARE: Patentin version 3.3 
; SEQ ID NO 369805 
LENGTH: 807 
TYPE: DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-369805 

Query Match 8.0%; Score 101.6; DB 8; Length 807; 

Best Local Similarity 63.5%; Pred. No. 2.2e-15; 

Matches 188; Conservative 0; Mismatches 104; Indels 4; Gaps 2; 



Qy 243 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

I I I I II II I 11 I I I I I II I M I I I I I I II I I I I I I I I I I I I I 

Db 266 GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 325 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

III I I I I I M I I I I I I I I I M I I I I I I I I I I I I I MINI I 

Db 326 CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 385 
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Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 

Db 386 ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 445 

Qy 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

I I I I II 1 t III I I I II I II I M I I I I I I I I I I I 

Db 446 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGGAGAATACCGCCACCTCCCG 505 

Qy 483 GAGCCCT— GGCTGAAGCTCACCGGCTT— CCCCTGCTCCTGGACCTTCTTCCACC 534 

II I II I II I II I I I III I I I I I I I I I I I I I I I I I I I 

Db 506 GGGGGATTACGTTCCGCTTCACCGGCCTACCCCTGGCTCCTAGACCTTCTACCACC 561 



RESULT 8 

US-11-266-748A-453184/C 

; Sequence 453184, Application US/11266748A 
; Publication No. US20060134663A1 
; GENERAL INFORMATION: 
; APPLICANT: Harkin, Paul 
'; APPLICANT: Johnston, Patrick 
; APPLICANT: Mulligan, Karl 

; TITLE OF INVENTION: Transcriptome Microarray Technology and 

; TITLE OF INVENTION: Methods of Using the Same 

; FILE REFERENCE: 55815-0102 (319189) 

; CURRENT APPLICATION NUMBER: US/11/2 66, 7 48A 

; CURRENT FILING DATE: 2005-11-03 

PRIOR APPLICATION NUMBER: EP 04105479.2 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105482.6 
; PRIOR FILING DATE: 2004-11-03 

PRIOR APPLICATION NUMBER: EP 04105483.4 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105507.0 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105485.9 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: EP 04105484.2 
; PRIOR FILING DATE: 2004-11-03 
; PRIOR APPLICATION NUMBER: US 60/662,276 
; PRIOR FILING DATE: 2005-03-14 
; PRIOR APPLICATION NUMBER: US 60/700,293 
; PRIOR FILING DATE: 2005-07-18 
; NUMBER OF SEQ ID NOS : 483996 
; SOFTWARE: Patentin version 3.3 
; SEQ ID NO 453184 
LENGTH: 807 
TYPE : DNA 

ORGANISM: Homo Sapiens 
US-11-266-748A-453184 

Query Match 8.0%; Score 101.6; DB 8; Length 807; 

Best Local Similarity 63.5%; Pred. No. 2.2e-15; 

Matches 188; Conservative 0; Mismatches 104; Jndels 4; Gaps 2; 

Qy 24 3 GGGACCTGGACTATTTACCCGGATGGCCGGTTTGGGAACCAGATGGGACAGTATGCCACG 302 

I > i I I Mill I I I I I I I I I Ill I 

Db 542 GGGATGTGGACGATCAATGCAATAGGCCGCCTGGGGAACCAGATGGGCGAGTACGCCACA 483 

Qy 303 CTGCTGGCCCTGGCGCAGCTCAACGGCCGCCAGGCCTTCATCCAGCCTGCCATGCACGCC 362 

IN Mill II I llllllllll I II I 

Db 482 CTGTACGCCCTGGCCAAGATGAACGGGCGGCCCGCCTTCATCCCGGCCCAGATGCACAGC 423 
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Qy 363 GTCCTGGCCCCCGTGTTCCGCATCACGCTGCCTGTCCTGGCGCCCGAGGTAGACAGGCAC 422 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 422 ACCCTGGCCCCCATCTTCAGAATCACCCTGCCGGTGCTGCACAGCGCCACGGCCAGCAGG 363 

Qy 423 GCTCCTTGGCGGGAGCTGGAGCTTCACGACTGGATGTCCGAGGATTATGCCCACTTAAAG 482 

II M I I I I III II I I I I I I II I I I I II Mill I 

Db 362 ATCCCCTGGCAGAACTACCACCTGAACGACTGGATGGAGGGAGAATACCGCCACCTCCCG 303 

Qy 483 GAGCCCT — GGCTGAAGCTCACCGGCTT — CCCCTGCTCCTGGACCTTCTTCCACC 534 

II I II I I I M I I I I III I II I I I I I I I I I I I I I I M 

Db 302 GGGGGATTACGTTCCGCTTCACCGGCCTACCCCTGGCTCCTAGACCTTCTACCACC 247 



RESULT 9 

US-10-561-201-3/C 

; Sequence 3, Application US/10561201 
; Publication No. US20060211115A1 
; GENERAL INFORMATION: 

; APPLICANT: The Trustees of the University of Pennsylvania 
; APPLICANT: Roy, Soumitra 
; APPLICANT: Wilson, James M. 

; TITLE OF INVENTION: Methods of Generating Chimeric Adenoviruses and Uses For Such 

; TITLE OF INVENTION: Chimeric Adenoviruses 

; FILE REFERENCE: UPN-P3067PCT 

; CURRENT APPLICATION NUMBER: US/10/561,201 

; CURRENT FILING DATE: 2005-12-19 

PRIOR APPLICATION NUMBER: US 10/465,302 
; PRIOR FILING DATE: 2003-06-20 
; PRIOR APPLICATION NUMBER: US 60/566,212 
; PRIOR FILING DATE: 2004-04-28 
; PRIOR APPLICATION NUMBER: US 60/575,429 
; PRIOR FILING DATE: 2004-05-28 
; NUMBER OF SEQ ID NOS : 18 

SOFTWARE: Patentin version 3.2 
; SEQ ID NO 3 

LENGTH: 3 6535 
TYPE: DNA 

; ORGANISM: chimpanzee adenovirus serotype Pan7 
US-10-561-201-3 

Query Match 3.8%; Score 48.6; DB 6; Length 3 6535; 

Best Local Similarity 54.1%; Pred. No. 0.11; 

Matches 99; Conservative 0; Mismatches 84; Indels 0; Gaps 0; 

Qy 773 CCCCGTCTTTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTC 832 

III I I I I I I III I I I I II II II I I I I M I I I I I II I 
Db 7852 CACCTACGATGTGGAAACCTACACCTGGATGGGGTCCTTCGGCAAGCAGCTCGTCCCCTT 77 93 

Qy 833 CCGGGGGGACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGC 892 

I Ill I M M I I 

Db 7792 CATGCTGGTCATGAAACTCTCCGGGGACCCGCCGCTCGTCGAGCTCGCCCACGACCTCGC 7733 

Qy 893 GCTGCTGGTGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGC 952 

N I Ml I I I llllllll III III 

Db 7732 CCTCCAGCTCAAGTGGGACCGCTGGCACGGCGACCCCCGCACCTTCTACTGCGTCACCCC 7673 

Qy 953 CTA 955 

I I 

Db 7 672 CGA 7670 
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RESULT 10 
US-10-561-201-2/C 

; Sequence 2, Application US/10561201 
; Publication No. US20060211115A1 
; GENERAL INFORMATION: 

/ APPLICANT: The Trustees of the University of Pennsylvania 
; APPLICANT: Roy, Soumitra 
; APPLICANT: Wilson, James M. 

; TITLE OF INVENTION: Methods of Generating Chimeric Adenoviruses and Uses For Such 

; TITLE OF INVENTION: Chimeric Adenoviruses 

; FILE REFERENCE: UPN-P3067PCT 

; CURRENT APPLICATION NUMBER: US/10/5 61,201 

; CURRENT FILING DATE: 2005-12-19 

; PRIOR APPLICATION NUMBER: US 10/465,302 

; PRIOR FILING DATE: 2003-06-20 

; PRIOR APPLICATION NUMBER: US 60/566,212 

; PRIOR FILING DATE: 2004-04-28 

; PRIOR APPLICATION NUMBER: US 60/575,429 

; PRIOR FILING DATE: 2004-05-28 

; NUMBER OF SEQ ID NOS : 18 

; SOFTWARE: Patentin version 3.2 

; SEQ ID NO 2 

LENGTH: 36604 

TYPE: DNA 

ORGANISM: chimpanzee adenovirus serotype Pan6 
US-10-561-201-2 

Query Match 3.8%; Score 48.6; DB 6; Length 36604; 

Best Local Similarity 54.1%; Pred. No. 0.11; 

Matches 99; Conservative 0; Mismatches 84; Indels 0; Gaps 0; 

773 CCCCGTCTTTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTC 832 
III I Milt III I I I I II II I I I I I I II I I I I I t I I 
784 6 CACCTACGATGTGGAAACCTACACCTGGATGGGGTCCTTCGGCAAGCAGCTCGTCCCCTT 7787 

833 CCGGGGGGACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGC 892 
I I I I t I I I I I I I I I III I I I I I I I I I I I I I II 
7786 CATGCTGGTCATGAAACTCTCCGGGGACCCGCCGCTCGTCGAGCTCGCCCACGACCTCGC 7727 

893 GCTGCTGGTGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGC 952 

Mill MM III I I I II MM II III III 

7726 CCTCCAGCTCAAGTGGGACCGCTGGCACGGCGACCCCCGCACCTTCTACTGCGTCACCCC 7667 

953 CTA 955 
I I 

7666 CGA 7664 



RESULT 11 

US-11-218-305-13034 

; Sequence 13034, T^plication US/11218305 
; Publication No. US200601414 95A1 
; GENERAL INFORMATION: 

; APPLICANT: MONSANTO TECHNOLOGY, LLC 
; APPLICANT: McLaird, Paul L. 
; APPLICANT: Tao, Nengbing 
; APPLICANT: Wu, Kunsheng 

; TITLE OF INVENTION: Polymorphic Markers and Methods of Genotyping 

; TITLE OF INVENTION: Corn. 

; FILE REFERENCE: 38-21 (53660) B 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 
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CURRENT APPLICATION NUMBER: US/11/218,305 
CURRENT FILING DATE: 2005-09-01 
PRIOR APPLICATION NUMBER: US 60/606,880 
PRIOR FILING DATE: 2004-09-01 
NUMBER OF SEQ ID NOS : 25043 
SOFTWARE: Patentin version 3.2 
SEQ ID NO 13034 
LENGTH: 6573 
TYPE: DNA 
ORGANISM: Zea mays 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (214) . . (214) 

OTHER INFORMATION: n is a, c, g, or t 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: ( 1342 )..( 1342 ) 
OTHER INFORMATION: n is a, c, g, or t 
FEATURE : 

NAME/KEY : misc_f eature 
LOCATION: (3788) . . (3788) 
OTHER INFORMATION: n is a, c, g, or t 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (5035) . . (5035) 
OTHER INFORMATION: n is a, c, g, or t 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (5217) (5217) 
OTHER INFORMATION: n is a, c, g, or t 
US-11-218-305-13034 

Query Match 3.7%; Score 47.4; DB 9; Length 6573; 

Best Local Similarity 48.0%; Pred. No. 0.12; 

Matches 135; Conservative 0; Mismatches 146; Indels 0; Gaps 0; 



Qy 


683 


GCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGCCGTTACCTCCAGCAGGC 
1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 III 1 1 II 
GTGTGTGGGTGTGGGGGGGCGGGGTGGGGTGGGGAGTGACACCCGGGGGCCGATGACGAC 


742 


Db 


2582 


2641 


Qy 


743 


TATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTTTGTGGTCACCAGCAACGGCAT 
1 1 1 1 1 1 1 1 1 1 1 1 1 II III 1 1 1 1 1 1 II 1 1 
GCTGCACATGGACCCGCCGGCGCCGCCCGCGCGCCGCTCAGTGTCGACCAGCTGCGACCT 


802 


Db 


2642 


2701 


Qy . 


803 


GGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGACGTGATCTTTGCTGGCGATGG 
1 1 1 1 1 1 1 1 11 1 1 1 II 1 1 1 1 1 1 1 1 1 II 
GCACCCGGGCGAGACGTTCACCGGCTTCTGCGCCGCGTGCCTGCGCGAGCGCCTCCACGG 


862 


Db 


2702 


2761 


Qy 


863 


GCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGTGCAGTGCAACCACACCATCAT 

1 1 1 1 1 1 1 i 1 II 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TCTCGAGGCGTCCGCCGCCGCCGCCTCCGCGCCGGGGCGCAAGTCCACATCGGCCATCCG 


922 


Db 


2762 


2821 


Qy 


923 


GACCATTGGCACCTTCGGCTTCTGGGCCGCCTACCTGGCTG 963 
MM III III 1 1 1 1 1 1 1 1 II 
GTCCCTGTTCGCCAGGCCGTTCGCCGCCGGCAGCTCGTCTG 28 62 




Db 


2822 





RESULT 12 
US-10-561-201-1/C 

; Sequence 1, Application US/10561201 
; Publication No. US20060211115A1 
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; GENERAL INFORMATION: 

; APPLICANT: The Trustees of the University of Pennsylvania 
; APPLICANT: Roy, Soumitra 
; APPLICANT: Wilson, James M. 

; TITLE OF INVENTION: Methods of Generating Chimeric Adenoviruses and Uses For Such 

; TITLE OF INVENTION: Chimeric Adenoviruses 

; FILE REFERENCE: UPN-P3067PCT 

; CURRENT APPLICATION NUMBER: US/10/5 61,201 

; CURRENT FILING DATE: 2005-12-19 

PRIOR APPLICATION NUMBER: US 10/465,302 
; PRIOR FILING DATE: 2003-06-20 
; PRIOR APPLICATION NUMBER: US 60/566,212 
; PRIOR FILING DATE: 2004-04-28 
; PRIOR APPLICATION NUMBER: US 60/575,429 
; PRIOR FILING DATE: 2004-05-28 
; NUMBER OF SEQ ID NOS : 18 

SOFTWARE: Patentin version 3.2 
; SEQ ID NO 1 

LENGTH: 3 6462 

TYPE : DNA 

; ORGANISM: chimpanzee adenovirus serotype Pan5 
US-10-561-201-1 

Query Match 3.7%; Score 47; DB 6; Length 36462; 

Best Local Similarity 53.6%; Pred. No. 0.27; 

Matches 98; Conservative 0; Mismatches 85; Indels 0; Gaps 0; 

Qy 773 CCCCGTCTTTGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTC 832 

III I INI IN I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7840 CACCTACGACGTGGAAACCTACACCTGGATGGGGTCCTTCGGCAAGCAGCTCGTCCCCTT 7781 

Qy 833 CCGGGGGGACGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGC 8 92 

I I II I I II I I II I I III I II I I I I [ I I I I I II 
Db 7780 CATGCTGGTCATGAAACTCTCCGGGGACCCGCCGCTCGTCGAGCTCGCCCACGACCTCGC 7721 

Qy 893 GCTGCTGGTGCAGTGCAACCACACCATCATGACCATTGGCACCTTCGGCTTCTGGGCCGC 952 

I I I I I I I I I I I I I I I I I I II I I I Ml Ml 

Db 7720 CCTCCAGCTCAAGTGGGACCGCTGGCACGGCGACCCCCGCACCTTCTACTGCGTCACCCC 7 661 

Qy 953 CTA 955 

II 

Db 7660 CGA 7658 



RESULT 13 

US-11-218-305-24825/C 

; Sequence 24825, Application US/11218305 

; Publication No. US200601414 95A1 

; GENERAL INFORMATION: 

; APPLICANT: MONSANTO TECHNOLOGY, LLC 

; APPLICANT: McLaird, Paul L. 

; APPLICANT: Tao, Nengbing 

; APPLICANT: Wu, Kunsheng 

; TITLE OF INVENTION: Polymorphic Markers and Methods of Genotyping 

; TITLE OF INVENTION: Corn. 

; FILE REFERENCE: 38-21 { 53660 )B 

; CURRENT APPLICATION NUMBER: US/11/218,305 

; CURRENT FILING DATE: 2005-09-01 

; PRIOR APPLICATION NUMBER: US 60/606,880 

; PRIOR FILING DATE: 2004-09-01 

; NUMBER OF SEQ ID NOS: 2504 3 
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SOFTWARE: Patentin version 3.2 
SEQ ID NO 24825 
LENGTH: 1158 
TYPE; DNA 
ORGANISM: Zea mays 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: (4) . . (4) 

OTHER INFORMATION: n is a, c, g, or t 
FEATURE : 

NAME/KEY: misc_feature 
LOCATION: (19) . . (19) 

OTHER INFORMATION: n is a, c, g, or t 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (36) . . (36) 

OTHER INFORMATION: n is a, c, g, or t 
US-11-218-305-24825 

Query Match 3.7%; Score 46.6; DB 9; Length 1158; 

Best Local Similarity 48.3%; Pred. No. 0.099; 

Matches 130; Conservative 0; Mismatches 139; Indels 0; Gaps 0; 



Qy 


272 


GTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTGGCCCTGGCGCAGCTCAACGGCCG 
1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 M 1 1 1 1 Mill III 
GTTCGGGCTCATGTACGCCTACTTTGCGACGGTGATGGACAAAGCGCGGAGGGTCCGCGC 


331 


Db 


564 


505 


Qy 


332 


CCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTGGCCCCCGTGTTCCGCATCACGCT 
i III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II III III 
CGGCGCCGGAGCGCTGGCGGCGCCGCTCGCCGTGGGGCTCCTGGCGGGGGCCAACGTGCT 


391 


Db 


504 


445 


Qy 


392 


GCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCTTGGCGGGAGCTGGAGCTTCACGA 
II 1 1 1 1 1 1 II 1 1 1 1 III 1 1 1 1 1 1 I 1 1 II 
GGCCTGCGGCGCGCTGGAGGGCGCCGTGATGAATCCGGCGCGCGCGTTCGGGCCCGCCGT 


451 


Db 


444 


385 


Qy 


452 


CTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCCTGGCTGAAGCTCACCGGCTTCCC 
II IN 1 1 II 1 II 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 
CGTGGGCTCCCGTCGCTGGAGACACCAATGGGTGTACTGGGTGGGGCCCATGGTCGGCGC 


511 


Db 


384 


325 


Qy 


512 


CTGCTCCTGGACCTTCTTCCACCACCTCC 540 
1 1 1 II 1 1 1 M 1 1 1 1 II 
CGGCCTCTCCGGCGTCGTCTACGAGCACC 296 




Db 


324 





RESULT 14 

US-11-056-355B-63909 

Sequence 63909, Application US/11056355B 
Publication No. US20060150283A1 
GENERAL INFORMATION: 
APPLICANT: Brover, Vyacheslav 
APPLICANT: Alexandrov, Nickolai 

TITLE OF INVENTION: Sequence Determined DNA Fragments and Corresponding 
TITLE OF INVENTION: Polypeptides Encoded Thereby 
FILE REFERENCE: 2750-1590PUS2 
CURRENT APPLICATION NUMBER: US/11/056, 355B 
CURRENT FILING DATE: 2005-02-14 
PRIOR APPLICATION NUMBER: 60/544,190 
PRIOR FILING DATE: 2004-02-13 
NUMBER OF SEQ ID NOS : 119966 
SEQ ID NO 63909 
LENGTH: 1196 
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TYPE: DNA 
; ORGANISM: Triticum aestivuin 
FEATURE : 

NAME/KEY: misc_f eature 
LOCATION: (1)..(1196) 

OTHER INFORMATION: Ceres Seq. ID no. 12616895 
US-11-05.6-355B-63909 

Query Match 3.7%; Score 46.6; DB 9; Length 1196; 

Best Local Similarity 47.4%; Pred. No. 0.1; 

Matches 174; Conservative 0; Mismatches 189; Indels 4; Gaps 1; 

Qy 54 6 CAGATCCGCAGCGAGTTCACCCTGCACGACCACCTTCGGCAAGAGGCCCAGGGGGTACTG 605 

II I II II I I I I I I III I I I M I I I III I I I I 

Db 609 CATGTCCTCCGATCTCTCCCCCATCCAGTCCGGCTGCTGCAAGCCTCCGATCAGCTGCGG 668 

Qy 606 AGTCAGTTCCGTCTACCCCGCACAGGGGACCGCCCCAGCACCTTCGTGGGGGTCCACGTG 665 

Ml I I I I I II I I II II I I i I I II I I III II I I I II 

Db 669 CTTCACCTACGTCAACAGCACGCAATGGACCGGCCCCGCCAAGTCGACGGAGCCCGACTG 728 

Qy 666 CGCCGCGGGGACTATCTGCGTGTGATGCCCAAGCGCTGGAAGGGGGTGGTGGGTGACGGC 725 

I I I I I I II I I I I I I I II I I I I I II I 

Db 729 CGGCGCGTGGTCCAACGACGGGGCGCTCTGCTACGGCTGCCAGTCGTGCAAGGCCGGCGT 788 

Qy 726 CGTTACCTCCAGCAGGC TATGGACTGGTTCCGGGCCCGATACGAAGCCCCCGTCTT 781 

I I I I I I I I I I I I I I I I I II I I I I I I 

Db 789 GGTGGCCACCCTCAAGCGCAATTGGAAGCGCTCCGCCATCATCAACATCGTCTTCCTCGT 848 

Qy 782 TGTGGTCACCAGCAACGGCATGGAGTGGTGCCGGAAGAACATCGACACCTCCCGGGGGGA 841 

I I II I I II I I I I I I II I I I I I II I I 

Db 849 CTTCATCATCATTGTCTACTCCGTCGGCTGCTGCGCCTTCAGGAACAACCGCCGCGACCA 908 

Qy 842 CGTGATCTTTGCTGGCGATGGGCGGGAGGCCGCGCCCGCCAGGGACTTTGCGCTGCTGGT 901 

I II I II I I I I I I I I I II I I II I I I I I II I 

Db 909 CCGCAACGGCGGCGGGTACAAGCAGCAGGGCGCGTACGCCTGATCGTTTGGCTCGGTTAT 968 

Qy 902 GCAGTGC 908 

I I I I 

Db 969 TTATTGC 975 



RESULT 15 ' ■ 
US-11-218-305-24824/C 

; Sequence 24824, Application US/11218305 
; Publication No. US200601414 95A1 
; GENERAL INFORMATION: 

; APPLICANT: MONSANTO TECHNOLOGY, LLC 
; APPLICANT: McLaird, Paul L. 
; APPLICANT: Tao, Nengbing 
; APPLICANT: Wu, Kunsheng 

; TITLE OF INVENTION: Polymorphic Markers and Methods of Genotyping 

; TITLE OF INVENTION: Corn. 

; FILE REFERENCE: 38-21 ( 53660 )B 

; CURRENT APPLICATION NUMBER: US/11/218,305 

; CURRENT FILING DATE: 2005-09-01 

; PRIOR APPLICATION NUMBER: US 60/606,880 

; PRIOR FILING DATE: 2004-09-01 

; NUMBER OF SEQ ID NOS : 2504 3 

SOFTWARE: Patentin version 3.2 
; SEQ ID NO 24824 
LENGTH: 1217 
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TYPE: DNA 

ORGANISM: Zea mays 
US-11-218-305-24824 

Query Match 3.5%; Score 45; DB 9; Length 1217; 

Best Local Similarity 48.0%; Pred. No. 0.25; 

Matches .129; Conservative 0; Mismatches 140; Indels 0; Gaps 0; 

Qy 272 GTTTGGGAACCAGATGGGACAGTATGCCACGCTGCTGGCCCTGGCGCAGCTCAACGGCCG 331 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 599 GTTCGGGCTCATGTACGCCTACTTTGCGACGGTGATGGACAAAGCGCGGAGGGTCCGCGC 540 

Qy 332 CCAGGCCTTCATCCAGCCTGCCATGCACGCCGTCCTGGCCCCCGTGTTCCGCATCACGCT 391 

I III I I I II I I I I I I I I I I II Ml IN 

Db 539 CGGCGCCGGAGCGCTGGCGGCGCCGCTCGCCGTGGGGCTTCTGGCGGGGGCCAACGTGCT 480 

Qy 392 GCCTGTCCTGGCGCCCGAGGTAGACAGGCACGCTCCTTGGCGGGAGCTGGAGCTTCACGA 451 

II I I I I I I I I I II I III II I I I I I II II 

Db 479 GGCCTGCGGCGCGCTGGAGGGCGCCGTGATGAATCCGGCGCGCGCGTTCGGGCCCGCCGT 420 

Qy 452 CTGGATGTCCGAGGATTATGCCCACTTAAAGGAGCCCTGGCTGAAGCTCACCGGCTTCCC 511 

I I III I III I III I I I I II I I I I II II 

Db 419 CGTGGGCTCCCGTCGCTGGAGACACCAATGGGTGTACTGGGTGGGGCCCATGGTCGGCGC 360 

Qy 512 CTGCTCCTGGACCTTCTTCCACCACCTCC 54 0 

III II I I I I I I I I I I I 

Db 359 CGGCCTCTCCGGCGTCGTCTACGAGCACC 331 



Search completed: November 14, 2006, 02:47:14 
Job time : 353 sees 



SCORE 1 . 3 Build Date : 12/06/2005 
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